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Tribal gathering 


Before they can construct a cell, researchers in synthetic biology must first 


build bridges between disciplines. 


Ithough ‘-ology’ denotes a subject of 
As synthetic biology is more about 
building than observation. The field’s 
founding papers were published just 14 years 


BEYOND DIVISIONS 


The future of synthetic biology 
nature.com/synbio 


parts are freely available and built-up com- 
ponents are eligible for patenting. Other gaps 
could be filled by technological and cultural 
innovations. Streamlined techniques can now 


ago, but the scientists who call themselves syn- 
thetic biologists already have disparate goals. By many accounts, 
‘synbio is less a coherent discipline than a collection of tribes under 
the same name. They do not interact with each other nearly enough. 

Some synthetic biologists design genetic circuits to make microbes 
do fun or useful tricks — blink in sync, count pulses of chemicals, 
pump out biofuels or detect environmental pollutants. Others revise 
the chemistry of life's construction materials, crafting DNA and 
proteins that contain non-natural building blocks. Still others stitch 
together stretches of DNA to create whole chromosomes. In 2010, 
scientists replaced a bacterium’s genome with a synthetic replica. In 
2014, a team fabricated a next-generation yeast chromosome, drop- 
ping cumbersome sequences and inserting convenient ones. 

And those who influence the field with calls for funding and regu- 
lation are also driven by different goals. They agree that the outcomes 
of synthetic biology cannot yet be predicted, but some put caution 
foremost, seeking to avert any disasters that might result from an 
engineered microbe, whereas others focus on rewards, imagining 
revolutions in energy, medicine, nutrition or manufacturing. 

So far, the field has avoided many of the public-image pitfalls that 
beleaguered genetically modified organisms (GMOs). Today, the 
term GMO is identified more with patented crops that help agri- 
culture companies to sell products than with potential tools to grow 
food more efficiently or generate life-saving drugs. The best known 
application of synthetic biology, by contrast, is probably the manu- 
facture of an accessible life-saving malaria medicine. 

Although synthetic biologists need to address public concerns, 
they must also tend to the field from within, by fostering connec- 
tions across disciplines. Divisions run deep. A synthetic biologist may 
come to a project as an iterative tinkerer, a methodical engineer or 
an intuitive explorer. Engineering types can be flummoxed by unpre- 
dictability, and sciencey types frustrated by demands for inflexible 
definitions and standards. 

Some researchers have focused on establishing standards so that 
various ‘gear parts’ can be easily interchanged and combined. Others 
resist, arguing that variability among cells and circuits defies human- 
devised specifications. Some scientists push for open access, the bet- 
ter to mix and match inventions; others call for strong protections for 
intellectual property, the better to incentivize development of useful 
applications. The paucity of connection between these groups has 
provided ample fodder for sociologists of science. 

As part of this week’s special issue, a News Feature on page 152 sug- 
gests a way forward: open access and the safe-guarding of intellectual 
property could work together in a ‘diverse ecology’ in which circuit 


characterize components across many contexts, 
including different cell types and combinations. These are helping to 
bring some predictability to the bewildering variability that is inher- 
ent in a living cell. Tellingly, one of the best known synthetic-biology 
companies, Amyris in Emeryville, California, combines randomized 
and rational approaches. It screens both randomly generated yeast 
mutants and designed strains every week to incrementally improve 
yields of a chemical product. 

Synthetic biologists suggest that the various camps fail to con- 
nect because of apathy and habit rather than outright hostility. 
Whatever the reason, the discipline suffers from its divisions. Asked 
how to overcome the obstacles facing synthetic biology, experts call 
for tighter integration between disciplines 


“There will and cultures (page 155). Means to this end 
probably always might include the annual International 
be many tribes Genetically Engineered Machine (iGEM) 
in synthetic competition or “build-a-genome’ courses, 
biology, but which produced the first complete syn- 
there is reason thetic yeast chromosome earlier this year. 
to believe the Although these programmes arose from an 
field can avoid open-access tribe within synthetic biology, 


they reach undergraduate and even high- 
school students — nascent scientists not yet 
ensconced in a specialty. Perhaps they will bring new and useful kinds 
of thinking into the field. 

Establishment of an international synthetic-biology society might 
be another way to foster productive mingling while allowing the field 
to maintain itselfas a coherent discipline. Some encouraging develop- 
ments have been evident at recent conferences, with panels com- 
prising members from various camps. And more research is needed 
to learn how, for instance, intellectual-property and open-access 
schemes can be combined to boost innovation. 

Although there will probably always be many tribes in synthetic 
biology, there is reason to believe that the field can avoid tribal- 
ism. Indeed, it is now more vital than ever that synthetic biologists 
present a united front. As Volker ter Meulen, co-chairman of IAP 
— the Global Network of Science Academies — writes in a World 
View on page 135, storm clouds are gathering on the horizon. Not 
everyone agrees that synthetic biology is a force for good, and that 
opposition has found its voice in a consultation for the global Con- 
vention for Biological Diversity. It is crucial, says ter Meulen, that 
the balancing voice of science is heard before false assumptions lead 
to the creation of onerous and unnecessary regulation. Everyone 
can agree on that. = 


tribalism.” 
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WORLD VIEW .ocnsicor non 


Time to settle the 
synthetic controversy 


If synthetic biology is to thrive, the world needs to decide now how the 
field should be regulated and supported, says Volker ter Meulen. 


r | he creation of an artificial yeast chro- 
mosome shows that synthetic biology 
is getting closer to what most scientists 

want: to be able to deliver benefits to society. The 


BEYOND DIVISIONS 


The future of synthetic biology 
nature.com/synbio 


from existing fields, such as genetic modifica- 
tion. This means that the work is not proceed- 
ing entirely without regulation, as some claim; 
much of it is, in fact, governed by existing rules. 


field has already found cheaper ways to produce 
medicines, and is making progress in applications from water purifica- 
tion to materials design. 

The topic is, however, controversial, and that is jeopardizing its 
promise. Environmental groups argue that it poses risks to health and 
the environment and have called for a global moratorium. We have 
been here before: exaggerated fears and uncritical acceptance of claims 
of the risks of genetic modification led to excessively cautious regula- 
tion and a block on innovation that not only slowed the development 
of new products, but also deterred basic science. 

The debate over synthetic biology is now 
entering a critical phase. The Conference of the 
Parties to the Convention on Biological Diver- 
sity (CBD) — the global framework that gov- 
erns the protection of biodiversity — is currently 
exploring possible restrictions and will clarify its 
position at meetings next month and in Octo- 
ber. But given the precedent of how the issue of 
genetically modified crops was handled, many 
scientists are worried that some policy-makers 
will take unsubstantiated concerns of environ- 
mental groups at face value and impose cumber- 
some and unnecessary rules. To prevent that, we 
need an objective, evidence-based and balanced 
assessment of the risks and benefits, both within 
and beyond the CBD. And that means that the 
voice of science must be heard. 

To kick-start the necessary dialogue, the IAP — the Global Net- 
work of Science Academies — has published a position statement (see 
go.nature.com/tmvhf8), drawn from the work of member institutes 
across Europe and the United States. We (I am co-chair) hope that 
the statement will spur debate on how best to feed scientific evidence 
that emerges from peer-reviewed research into the development of 
policy. This includes the regulations for overseeing research and inno- 
vation, as well as, for instance, investment in relevant infrastructure 
and training. 

The IAP represents 106 academies worldwide and wants to have a 
more active role in global policy issues; last year, for instance, we pub- 
lished a similar position statement on antimicrobial-drug resistance. 

In the case of synthetic biology, the world needs to commit to 
addressing several priorities. First, the scope of synthetic biology needs 
be determined. We describe it as the construction of customized bio- 
logical systems to perform new and improved functions, through the 
application of principles from engineering and chemical synthesis. 

The goal might be new, but many of the techniques are borrowed 


FUNDING 


BODIES ACROSS THE 


WORLD 


MUST ANTICIPATE 
THE 


POTENTIAL 


OF SYNTHETIC 
BIOLOGY. 


The use, release and movement across borders 
of genetically modified organisms, for instance, are covered by the 
application of the Cartagena Protocol on Biosafety. 

The recognition that key methods are already controlled is crucial, 
because it should defuse some of the public controversy about risk. 
Also important is striking the right balance between statutory regu- 
lation and self-governance by scientists and scientific bodies. (The 
IAP and others have published recommendations on how to develop 
individual and institutional codes of conduct.) 

A second aspect that must be considered more 
broadly is how the results of synthetic biology are 
owned and shared. The current situation reflects 
its mixed parentage from both the biosciences 
(with its tradition of patenting) and engineer- 
ing and software development (which embrace 
open sourcing and sharing). The announcement 
of how researchers worldwide worked to produce 
a synthetic yeast chromosome shows how open- 
ness can pay off in academia. 

As synthetic biology progresses, techniques 
and tools will inevitably be developed that are not 
covered by existing regulations. It is reasonable to 
assume, the IAP argues, that these will allow the 
research to be done with greater precision. More- 
controlled modifications to genetic sequences, 
cells and organisms will facilitate characteriza- 
tion and bring the prospect of reducing unex- 
pected and unwanted side effects. Future synthetic-biology techniques 
and products should therefore be easier to regulate, manage and audit 
than earlier, less controlled genetic-modification techniques. 

Finally, the LAP says that funding bodies across the world must 
anticipate the potential of synthetic biology and invest in the research, 
and in the researchers involved. The investment should also incorpo- 
rate projects in the social sciences and the humanities, which can, for 
instance, look at concerns about biologists ‘creating life’ and find better 
ways to communicate the issues. 

Together, these steps should help to ensure that policies on synthetic 
biology set out sensible practices to mitigate the risk that is inherent 
in any major advance, yet are flexible enough to encourage research 
and innovation. = 


Volker ter Meulen is co-chair of the IAP — the Global Network of 
Science Academies in Trieste, Italy, and past president of the German 
Academy of Sciences Leopoldina. 

e-mail: volker.termeulen@mail.uni-wuerzburg.de 
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Selections from the 
scientific literature 


RESEARCH HIGHLIGHTS 


White matter 
matters in autism 


A genetic mutation that causes 
defects in the insulation of 
neurons may contribute 
to the cognitive deficits 
seen in autism and other 
neuropsychiatric disorders. 
Pratik Mukherjee of the 
University of California in 
San Francisco and his 
colleagues used magnetic 
resonance imaging to study 
the brains of 23 children with 
a deletion in a region of the 
genome associated with such 
disorders. 
The authors found that 
in these children, the white 
matter, which helps signals 
travel between neurons, was 
structurally defective. The 
children scored lower on 
non-verbal IQ tests (such 
as visual recognition and 
memory) than those without 
the genetic deletion. 
J. Neurosci. 34, 6214-6223 
(2014) 


Tipping point for 
Antarctic melting 


Loss of a crucial ‘plug’ of ice 
in Antarctica could result in 
massive sea-level rise, with 
the discharge of a huge ice 
sheet. 

The Wilkes Basin in East 
Antarctica contains enough 
ice to raise global sea levels 
by 3-4 metres. Anders 
Levermann and Matthias 
Mengel at the Potsdam 
Institute for Climate Impact 
Research in Germany 
used topographic data and 
simulations of ice dynamics 
to investigate the basin’s 
vulnerability to climate 
change. They found that ifa 
key plug of ice — equivalent 
to 80 millimetres of global 
sea-level rise — were to be lost 


Sweat-pore fingerprint 


A polymer that is sensitive to sub-nanolitre volumes of water 
could be used to map sweat pores for fingerprint analysis. 
Jong-Man Kim and Chan Woo Lee at Hanyang University 
in Seoul and their colleagues developed a thin film made of 
caesium polydiacetylene that changes from blue to red in the 
presence of water. The authors found that a fingertip pressed 
on the blue film produced a pattern of red dots (pictured) that 
coincided with sweat pores and was visible using an optical 


microscope. 


The sensor technology could be used to identify people by 
their fingerprint even if only a small part of it was visible, the 


authors say. 


Nature Commun. 5, 3736 (2014) 


from the region, the Wilkes 
Basin ice sheet would be 
destabilized, creating a self- 
sustaining discharge of the 
whole basin over 10,000 years 
and leading eventually to 
global sea-level rise on the 
metre scale. 

However, the ice sheet would 
remain stable if the equivalent 
of less than 60 millimetres of 
sea-level rise were lost. 

Nature Clim. Change http://dx.doi. 
org/10.1038/nclimate2226 (2014) 
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VIROLOGY 


Antibodies for 
Middle East virus 


Two independent teams have 
identified antibodies that can 
vanquish a deadly virus first 
reported in the Middle East. 
Since it appeared in 2012, 
the Middle East respiratory 
syndrome (MERS) coronavirus 
has caused disease in at least 
261 people and killed nearly 
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100 in Asia, North Africa 
and Europe as of 26 April, 
according to the World Health 
Organization. There are no 
vaccines or drugs for the virus. 

Wayne Marasco at the 
Dana-Farber Cancer Institute 
in Boston, Massachusetts, and 
Ralph Baric at the University 
of North Carolina in Chapel 
Hill and their team identified 
seven human antibodies that 
recognize different parts of 
a key MERS coronavirus 
protein and prevent it from 
recognizing and entering 
human cells in culture. 

Another team, led by Lingi 
Zhang at Tsinghua University 
in Beijing, identified two more 
neutralizing antibodies. The 
molecules recognize different 
portions of the same MERS 
coronavirus protein and work 
together to block the virus’s 
ability to infect cells. 

If the antibodies prove 
to be safe and effective for 
human use, they could be 
administered to prevent or 
treat MERS coronavirus 
infections, both teams say. 
Proc. Natl Acad. Sci. USA 
http://doi.org/smr (2014); 
Sci. Transl. Med. 6, 234ra61 
(2014) 


METABOLISM 


Hunger marks 
offspring’s genome 


Male mice that are 
undernourished while in the 
womb sire offspring with an 
increased risk of metabolic 
disorders such as diabetes. 

Josep Jiménez-Chillaron at 
the Sant Joan de Deu Hospital 
in Barcelona, Spain, and his 
colleagues underfed pregnant 
mice and found that male 
offspring had a chemical 
modification on the Lxra 
gene, which regulates fat 
metabolism in the liver. 

This ‘epigenetic’ change 
was transmitted through the 
offspring’s sperm to the next 


J.-M. KIM ETAL. 
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generation. It was also specific 
to certain types of tissue, such 
as the liver and skeletal muscle. 
Both generations showed 
reduced expression of genes 
that synthesize lipids in the 
liver. This makes them prone to 
developing glucose intolerance, 
which can lead to diabetes. 

The results suggest that the 
epigenetic effects of maternal 
malnutrition affect both 
the offspring and the next 
generation. 

Cell Metab. http://doi.org/sms 
(2014) 


Drug resistance 
from manure 


Dairy-cow manure that is 
spread on farmland could also 
be spreading a wide variety of 
genes encoding antimicrobial 
resistance, reports a 
Connecticut-based team. 

Jo Handelsman at Yale 
University in New Haven 
and her team analysed DNA 
from five samples of manure 
from dairy cows (pictured) 
and found 80 genes froma 
variety of bacteria that encode 
proteins for resistance to a 
wide range of antibiotics. 
Among them was a new group 
of genes that are specific 
to cow manure and confer 
resistance to the antibiotic 
chloramphenicol, which is 
used clinically. The protein 
sequences deduced from the 
80 genes were, on average, only 
50-60% identical to known 
amino-acid sequences. 

Cow manure is commonly 
used asa crop fertilizer, and 
this could be one way in which 


yo 


resistance genes disperse 
beyond the dairy farm, the 
team suggests. 

mBio 5, e01017-13 (2014) 


Heavy neutrino 
may be dark matter 


‘Sterile neutrinos could be the 
source of a mysterious signal 
believed to be coming from 
dark matter in galaxy clusters. 

Standard neutrinos are 
thought to be too light to be 
candidates for dark matter. 
But Kevork Abazajian of 
the University of California, 
Irvine, calculated how the 
hypothetical sterile neutrinos, 
which theories suggest could 
have larger masses than 
standard neutrinos, might 
decay to produce X-ray 
emissions such as those seen 
earlier this year by the 
XMM-Newton and Chandra 
X-ray space telescopes. 

By factoring in the particles 
interactions in the early 
Universe, Abazajian shows 
how sterile neutrinos could 
also be better than other dark- 
matter models for explaining 
how mass is distributed in the 
Milky Way. 

Phys. Rev. Lett. 112, 161303 
(2014) 


MARINE BIOLOGY 


Sea life booms in 
hot climates 


Deep-sea ecosystems are 
sensitive to global temperature 
variations on shorter 
timescales than previously 
thought. 

Moriaki Yasahara at the 
University of Hong Kong 
and his colleagues took 

samples of a fossil-rich 

sediment core from the 
northeastern Atlantic 
Ocean to study variability 
in the abundance of deep- 
water species over the past 
20,000 years. Focusing ona 
scale of decades to centuries, 
the researchers found that 
abrupt glaciations such as 
the Younger Dryas cooling 
period some 13,000 years ago 
led to significant increases 
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SOCIAL SELECTION 


Popular articles 
on social media 


ENCODE debate revived online 


In the social-media age, scientific disagreements can quickly 
become public — and vitriolic. A report from the ENCODE 
(Encyclopedia of DNA Elements) Project consortium proposes 
a framework for quantifying the functional parts of the 

human genome. It follows a controversial 2012 Nature paper 
by the same group that concluded that 80% of the genome 

is biochemically functional (Nature 489, 57-74; 2012). Dan 
Graur, who studies molecular evolutionary bioinformatics at 
the University of Houston in Texas and is a vocal ENCODE 


2 « 


critic, weighed in on this latest report. ENCODE's “stupid 
claims” from 2012 have finally come to back to “bite them in the 
proverbial junk’, Graur wrote on his blog. The targets noticed. 
“Some people seek attention through hyperbole and mockery,’ 
says the report’ first author Manolis Kellis, a computer scientist 
at the Massachusetts Institute of Technology in Cambridge. 
“We should stay focused on the issues.” 

Proc. Natl Acad. Sci. USA 111, 6131-6138 (2014) 


Based on data from altmetric.com. 
Altmetric is supported by Macmillan 
Science and Education, which owns 
Nature Publishing Group. 


in biodiversity on the ocean 
floor within around 100 years. 
Weakening ocean-circulation 
patterns during these surface- 
cooling events resulted in 
warmer deep-ocean waters. 
This high sensitivity to rapid 
temperature variation shows 
how marine biodiversity could 
respond suddenly to human- 
induced climate change this 
century, the authors say. 
Global Ecol. Biogeogr. http://doi. 
org/sk7 (2014) 


Maize vulnerable 
to drought 


Yields of maize (corn) have 
increased in the central 
United States since 1995, but 
so has the crop’s sensitivity to 
drought. 

David Lobell from Stanford 
University in California and 
his colleagues studied yields 
of US maize (pictured) and 
daily weather data for between 
1995 and 2012, focusing on 
a measure of atmospheric 
dryness called vapour- 
pressure deficit. The authors 


> NATURE.COM 
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found that yields have been 
increasing overall, but in 

years of high vapour-pressure 
deficits, maize grew more 
slowly. The team says that the 
practice of packing plants close 
together could be boosting this 
sensitivity. 

Climate change could have 
larger negative effects on US 
maize crops than previously 
thought, because vapour- 
pressure deficit in the region is 
predicted to rise as the climate 
warms, the authors say. 

Science 344, 516-519 (2014) 
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SEVEN DAYS 


POLICY 


Rising resistance 
The world is heading for 

a “post-antibiotic era” in 
which drugs against common 
infections will cease to work, 
warns the World Health 
Organization. On 30 April, 
the agency called for a global 
surveillance system to track 
how antibiotics are being 
used and where resistance is 
cropping up. See page 141 for 
more. 


UK animal research 
Ina bid to improve 
transparency, the UK 
government is considering 
reforming a law about animal 
research conducted by the 
nation’s institutions. Current 
rules prohibit the Home Office 
from releasing any information 
that could be considered 
confidential, such as on the 
type of animal research or 

the institutions involved. 

On 1 May, the government 
launched a consultation on its 
proposal to repeal this blanket 
ban and replace it with a rule 
that protects information about 
“people, places and intellectual 
property”. See go.nature.com/ 
zijvk2 for more. 


UK spaceport 

The UK government is looking 
into building a commercial 
spaceport by 2018. The move, 
announced in a report on 

30 April, is part ofa raft of 
measures designed to boost 
the UK space industry. A 
committee will consider 
locations for the port and 
review regulation reforms to 
enable commercial space flight 
in the United Kingdom. 


Reef heritage 

The Great Barrier Reef, which 
lies off the coast of Queensland 
in Australia, narrowly missed 
being listed as ‘in danger’ by 
the committee that manages 
the list of world heritage sites. 


The news in brief 


More protection for Coral Sea 


One of the world’s largest marine conservation 
areas has been created in the Coral Sea by 

New Caledonia, a French territory more 

than 1,000 kilometres east of Australia. The 
government of New Caledonia said on 23 April 
that a management plan is in development 


The preliminary decision 
taken by the United Nations 
Educational, Scientific 

and Cultural Organization 
committee on 30 April is 
expected to be finalized 

at a meeting in June. The 
committee acknowledged 

that Australia has taken some 
measures to manage the 

reef, but expressed concern 
about developments such as a 
planned dumping of dredged 
material. It warned that the reef 
could still make the list in 2015 
unless it is further protected. 


RIKEN paper chase 
Pressure has intensified 

on a Japanese committee 
investigating misconduct in 
two stem-cell papers after 
whistleblowers accused three 
panel members of problems 
with their own work. This 
follows the panel's judgement 
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of misconduct last month for 
Haruko Obokata, of the RIKEN 
Center for Developmental 
Biology in Kobe (which she 

is appealing against), and last 
weeks resignation by the panel's 
head over alleged problems 
with two of his papers. Last 
week, RIKEN also made public 
that on 25 April, the institute's 
president Ryoji Noyori told all 
280 principal investigators to 
examine their papers; a RIKEN 
spokesperson notes that, 
collectively, these researchers 
publish more than 2,500 papers 
each year. See go.nature.com/ 
de7dxu for more. 


PEOPLE 


Stem-cell head 

The California Institute for 
Regenerative Medicine (CIRM) 
announced Randal Mills 

as its next president on 
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for the 1.3-million-square-kilometre marine 
park, but exactly how much of the area will be 
off-limits to fishing is not yet clear. Australia has 
already protected large swathes of the Coral Sea, 
which lies off its eastern coast and is connected 
to the Pacific Ocean (see Nature 495, 155; 2013). 


30 April. Mills is former 

chief executive of Osiris 
Therapeutics in Columbia, 
Maryland — the first company 
to commercialize a stem-cell 
drug. His appointment is seen 
as a bid to help the institute to 
deliver stem-cell treatments. 
Mills will begin in the post on 
15 May, replacing reproductive 
biologist Alan Trounson, who 
announced his departure 

last October. CIRM is one of 
the world’s largest stem-cell 
funding agencies. 


Society fellows 
The Royal Society, Britain’s 
national science academy, 
announced on 1 May the 
election of 50 new fellows, 
including England's chief 
medical officer Sally Davies. 
The society also announced 
ten new foreign members, 
among them Nobel physics 
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laureate and former US energy 
secretary Steven Chu. Just 
seven (14%) of the newly 
elected fellows are women, 
leaving the Royal Society 
trailing behind the US National 
Academy of Sciences, which 
two days earlier announced 

84 new members, of whom 
21% are women. See go.nature. 
com/dehfq9 for more. 


| FUNDING 
Research red tape 


Researchers at US institutions 
are overloaded with 
administrative tasks that are 
hampering their ability to do 
science, warns the National 
Science Board in Arlington, 
Virginia, in a report published 
on 1 May. The report surveyed 
thousands of federally 
funded scientists. It found 
that the most burdensome 
requirements include writing 
research grants and reporting 
progress. “Excessive and 
ineffective requirements 

take scientists away from 

the bench unnecessarily and 
divert taxpayer dollars from 
research to superfluous grant 
administration,’ says Arthur 
Bienenstock, chairman of the 
task force that produced the 
report. 


New-look spacesuit 


Martian-bound astronauts may 
one day sport spacesuits similar 


TREND WATCH 


The spread of wild poliovirus is 
a public-health emergency that 


requires a coordinated global 
response, the World Health 


Organization declared on 5 May. 


The agency said that people in 
countries currently spreading 


cases abroad (Pakistan, Cameroon 


and Syria) must be vaccinated 


to the prototype pictured. The 
bright blue styling unveiled 

by NASA on 30 April was the 
public’s favourite, winning 
233,431 votes (nearly two- 
thirds). Unlike previous 
generations of designs, this suit 
includes a rigid upper torso to 
protect the astronaut during 
ventures out onto the Martian 
surface; it also features more 
flexible hip and shoulder joints. 
The suit is expected to be ready 
for testing by November. It 

will be put through its paces in 
the gigantic pool in Houston, 
Texas, that NASA uses to train 
astronauts for spacewalks. 
Trips to Mars, however, remain 
decades away. 


| RESEARCH 
Element 117 


Scientists have created atoms 
of the superheavy element 117, 
they announced on | May, 
confirming the element's 
discovery four years ago by 

a different group. The latest 


POLIO-SPREAD FEARS 


experiments were conducted 

at the GSI Helmholtz Centre 
for Heavy Ion Research 

in Darmstadt, Germany 

(J. Khuyagbaatar et al. Phys. 
Rev. Lett. 112, 172501; 2014). 
The atoms, which were created 
by smashing together lighter 
elements, were first glimpsed at 
the Joint Institute for Nuclear 
Research in Dubna, Russia, 

in 2010. Official recognition 

of the discovery, anda name, 
await the approval of experts at 
the International Union of Pure 
and Applied Chemistry and the 
International Union of Pure 
and Applied Physics. 


New poxvirus found 


A virus related to smallpox 

has emerged in the Republic 

of Georgia. Epidemiologist 

Neil Vora of the US Centers for 
Disease Control and Prevention 
in Atlanta, Georgia, and his 
colleagues isolated a virus from 
two cow herders who became 

ill after handling sick animals, 
and discovered that it was not 
cowpox buta novel type. The 
researchers reported their 
results on 30 April at a meeting 
in Atlanta. Finding a new 
poxvirus that affects humans is 
rare, but this is probably because 
of alack of surveillance, the 
team says. 


| BUSINESS 
Sales suspended 


The maker of a device used 
to remove benign uterine 


Wild poliovirus is endemic in only three countries, but clinical cases have 
been confirmed in seven others in 2013-14. 


Syria 36 
f 
Israel: virus 
detected in 

Nigeria . [Sewage. 
‘| 55 cases 


poe ae 


“Afghanistan 17 
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before travelling; other nations in 
which polio is endemic (see chart) 
should encourage immunization. 
The measures were invoked under 
the binding International Health 
Regulations, last used in the 2009 
HIN1 influenza pandemic. 
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SEVEN DAYS | THIS WEEK | 


10-13 MAY 

The European Society of 
Clinical Microbiology 
and Infectious Diseases 
meets in Barcelona, 
Spain. The agenda 
includes antimicrobial 
resistance and viral 
infections in pregnancy. 
go.nature.com/n6wqzw 


19-24 MAY 

The general assembly 

of the World Health 
Organization takes place 
in Geneva, Switzerland. 
The meeting will elect 

a new president and 
discuss a global strategy 
to prevent tuberculosis. 
go.nature.com/myojx8 


growths has suspended its 
sales, following a warning 
that the device increases the 
risk of spreading undetected 
cancer. Johnson & Johnson of 
New Brunswick, New Jersey, 
announced the suspension 
on 29 April, after the US Food 
and Drug Administration 
recommended that physicians 
eschew the device. Known as 
a power morcellator, it is used 
to break up growths known 
as fibroids so that they can be 
more easily removed during 
surgery. See go.nature.com/ 
osoyjw for more. 


Pharma mega-deal 


Pharmaceutical giant Pfizer, 
headquartered in New York, 
has been rebuffed after 

a second attempt to buy 
London-based drug firm 
AstraZeneca, a merger that 
would create the world’s largest 
pharmaceutical company. On 
2 May, AstraZeneca rejected 
Pfizer’s improved bid of 

£63 billion (US$107 billion). 
But on 5 May, Pfizer's chief 
executive Ian Read said that 
he still intended to pursue a 
transaction. See go.nature. 
com/yqgki3 for more. 
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A tale of 


Antibiotics can be bought without a prescription in many countries, and overuse of the drugs is fuelling the evolution of resistant microbes. 


PUBLIC HEALTH 


Antibiotic resistance 
sweeping developing world 


Bacteria are increasingly dodging extermination as drug availability outpaces regulation. 


BY SARA REARDON 


y most standards, the increasing avail- 
B ability of life-saving antibiotics in the 

developing world is a good thing. But, 
around the globe, overuse of these drugs has 
created resistant strains of deadly bacteria — 
and they could be a greater threat in poorer 
nations than in richer ones, owing in part toa 
lack of regulation. 

Such concerns come amid a stark warning 
from the World Health Organization (WHO) 
on 30 April (see go.nature.com/c2l7ry). The 
world, it says, is poised to enter a ‘post-anti- 
biotic era. And in some places, that era has 


already arrived. In Nigeria, for instance, some 
studies suggest that as many as 88% of Staphy- 
lococcus aureus infections cannot be treated 
with methicillin — once a potent weapon 
against the microbe. The problem also seems 
to be particularly acute in the emerging econo- 
mies known as the ‘BRIC’ states: Brazil, Rus- 
sia, India and China, says Keith Klugman, an 
epidemiologist for the Bill & Melinda Gates 
Foundation in Seattle, Washington. 

Up to 95% of adults in India and Pakistan 
carry bacteria that are resistant to B-lactam 
antibiotics — which include carbapenems, 
considered to be antibiotics of ‘last resort’ — 
according to research by Timothy Walsh, a 
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medical microbiologist at Cardiff University, 
UK, that is due to be published in The Lancet. 
By comparison, only 10% of adults in the 
Queens area of New York carry such bacte- 
ria. The spread of resistance is “more than we 
could have imagined’, Walsh says. 

And these are only the microbial threats that 
researchers know about. The WHO’s report — 
which includes the first ever global map of anti- 
biotic resistance — notes that just 129 of 194 
member countries provided any national data 
on drug resistance (see ‘Spreading scourge’). 
And only 22 of those had tracked all of the nine 
bacteria—antibiotic pairs that the agency names 
as the greatest threats to public health. That 
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SPREADING SCOURGE 


Many countries lack reliable data to track emerging microbial threats, according to the World Health 
Organization. In large areas of the world, fewer than five antibiotic-resistant bacteria—drug pairs are monitored. 
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> list includes S. aureus and methicillin, 
Escherichia coli and cephalosporins, and 
Klebsiella pneumoniae and carbapenems. 

Such problems are compounded by a lack of 
new antimicrobial drugs in development. This 
is more than just a problem of future antibiotic 
supply: pharmaceutical companies’ clinical tri- 
als of antibiotics were once a valuable source of 
resistance data for public-health researchers, 
says Klugman. “Now that we live in a world 
where there are very few new antibiotics, there 
is almost none of that support.” 

As aresult, researchers do not know what 
factors have caused resistance to grow so rap- 
idly in developing nations. For instance, it is 
not clear to what extent the rise of resistance 
has been spurred by the use of antibiotics 
for growth promotion in livestock, or by the 
release of antibiotics into wastewater by drug- 
manufacturers in countries such as India. 


One likely culprit is a lack of sanitation. 
In many areas, wastewater from hospitals 
is poorly filtered, allowing the antibiotic- 
resistant bacteria that flourish there to escape 
into waterways. If people drink this contami- 
nated water or practise poor hygiene, the 
bacteria can spread. “If you get increased anti- 
microbial use but don’t have the infrastructure 
for infection control, you're setting yourself 
up for a tsunami of antibiotic resistance,’ says 
Klugman. 

Other problems include overprescribing or 
unregulated use of antibiotics. In China, for 
instance, hospitals and clinics receive finan- 
cial incentives for prescribing, and antibiot- 
ics are overused as a result. Some countries 
allow pharmacies to sell antibiotics without 
prescription — and people buy them even for 
diseases that antibiotics cannot treat, such as 
malaria. “It’s essentially destroying a valuable 


resource,” says Ramanan Laxminarayan, the 
director of the Center for Disease Dynamics, 
Economics & Policy (CDDEP) in Washington 
DC. Yet insisting that antibiotics be prescribed 
by physicians in nations where poverty is rife 
and visits to a doctor are a luxury would mean 
denying many people drugs that they do need. 

Meanwhile, in developed countries, inap- 
propriate prescribing can be a problem if phy- 
sicians bow to pressure from patients. Training 
pharmacists and educating the public on cor- 
rect use of antibiotics could go a long way 
towards curbing resistance around the world, 
Laxminarayan says. 

Solving these problems will require better 
monitoring of drug resistance, says Stuart 
Levy, a physician-scientist at Tufts University 
in Boston, Massachusetts, who heads the non- 
profit Alliance for the Prudent Use of Antibiot- 
ics. “You cant just go in and scoop up a sample 
and get results of resistance and think that’s 
the country-wide resistance rate,” he says. The 
WHO report proposes a new global surveil- 
lance network, although it is not clear who 
would pay for it. Levy says that hospitals, non- 
governmental organizations and local com- 
munities could help to track antibiotic usage 
and test for resistance with inexpensive, readily 
available screens. 

One thing is clear: new types of antibiotic, 
although sorely needed, cannot offer a solution 
on their own. They are likely to be expensive 
and unaffordable in the developing world. 
More importantly, says Sumanth Gandra, a 
CDDEP epidemiologist, it is all but inevita- 
ble that new drugs will eventually lose their 
potency, too. “We'll continue the cycle unless 
we have new drugs and approach the way we 
use them differently,’ says Gandra. m 


SOURCE: WHO 


NATURAL HAZARDS 


Avalanche hotspot revealed 


Study of disaster-prone Russian islands underscores perils of colonizing unfamiliar terrains. 


BY JANE QIU 


ussian writer Anton Chekhov noted 
R: 1895 that the strong blizzards on 
the island of Sakhalin, once home toa 
Russian penal colony, were a “friend” to the 
resident convicts, killing several soldiers. 
The blizzards are also the main cause of ava- 
lanches on Sakhalin and the nearby Kuril 
islands, and an analysis has now revealed the 
region to have one of the world’s deadliest 
avalanche records (see ‘Snow fall’). 
The death toll in a 100-year period on 
the Kurils and Sakhalin — a large island 
in the north Pacific Ocean — exceeds all 
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avalanche-related fatalities in the history of 
Canada or New Zealand. The study highlights 
the need for proper avalanche assessment and 
mitigation measures, such as snow fences. 
Until now, the extent of avalanches on the 
islands had remained elusive because of the 
their divided history, says Evgeny Podolskiy, 
a Russian avalanche researcher at the French 
National Research Institute of Science and 
Technology for Environment and Agriculture 
near Grenoble, who led the analysis. Fought 
over by Japan and Russia during the twentieth 
century, Sakhalin and the Kuril islands — an 
archipelago that forms part of the ‘ring of fire’ 
a band of intense volcanic activity encircling 
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the Pacific — had no continuous avalanche 
record. That “has largely hindered the efforts 
to mitigate avalanche risks’, says Podolskiy. 

The findings also have implications for 
the migration of people to mountainous 
terrains, especially in developing countries, 
say Podolskiy and his colleagues, who pub- 
lished their study last month in the Journal 
of Glaciology (E. A. Podolskiy et al. J. Glaciol. 
60, 409-430; 2014). 

Despite a legacy of limited information 
exchange between Russia and Japan, Podolskiy 
was determined to piece together a complete 
history of avalanches on the islands. After 
talking to colleagues in Japan, where he did 


SERGEY KRASNOUHOV/RIA NOVOSTI 


J. S. KIM ETAL. NATURE 


his PhD, Podolskiy found a rare archive. 
Kaoru Izumi, an avalanche scientist at Niigata 
University, had a collection of newspaper arti- 
cles on avalanches that occurred between 1910 
and 1945 on the islands, when they were under 
Japanese control. Izumi had hoped to one day 
work with Russian scientists. 

An astonishing picture emerged from 
combining and analysing the Japanese and 
Russian records. Podolskiy, Izumi and their 
team found that there had been at least 
756 fatalities and 238 injuries in 275 ava- 
lanches on the islands between 1910 and 
2010. The worst avalanche — the deadliest 
in Soviet history — killed 149 people on 
Sakhalin in 1945, when it buried a miners’ 
settlement at night. 

The study also highlights the danger of small 
avalanches, says Karl Birkeland, a researcher 
at the US Forest Service National Avalanche 
Center in Bozeman, Montana. He says that cat- 
astrophic avalanches do not come only in the 
form of “huge monsters coming down from 
high mountains over a long distance”. 

The avalanches that occurred on the islands 
were mostly on small, gentle slopes with verti- 
cal drops of 100-200 metres. It is surprising 
that this kind of topography could produce 
such deadly avalanches, says Birkeland. 

Mountains on Sakhalin and the Kuril 
islands are less than 1,600 metres high, and 
most settlements are concentrated in coastal 
areas with gently rolling topography. But “the 
innocent-looking slopes are deceptive’, says 
Podolskiy. The high latitudes and maritime 
climate mean that the winter is very cold 


y 
“ig 


Kuril 


islands 

Sakhalin i 

! 

1 

A 

HINA 
J ARAN 
ae 
_ SNOW FALL 


Sakhalin and the Bari islands are some of the 
most eee places in the world. 


> 


MORE 
ONLINE 


VIDEO OF THE WEEK 


IN FOCUS | NEWS 


A train buried by an avalanche on the Russian island of Sakhalin in 2010. 


and humid, with heavy snow storms for 
five months a year. 

With a large amount of snow in a wind- 
swept terrain, “you don't need a big slope 
to have catastrophic avalanches”, says 
Kouichi Nishimura, an avalanche researcher 
at Nagoya University in Japan. Early settle- 
ments were hidden away from the wind under 
cliffs with ‘snow cornices, large overhangs 
formed by windswept snow. If these broke 
away and triggered an avalanche, the destruc- 
tion could be immense. 

The islands’ first colonizers “were not 
aware of the risks and built houses, factories 
and roads in avalanche-prone areas without 
proper protection’, notes Sergey Sokratov, a 
snow scientist at the Lomonosov Moscow State 
University in Russia. 

By comparing the avalanche data against 
population records, the researchers found that 
periods of high avalanche fatalities — from the 
1930s to the 1960s, for example, when the fatal- 
ity rate was as high as 60 per 100,000 people per 
decade — were associated with waves of popu- 
lation influx, rapid industrial development and 
large-scale deforestation. The team showed a 
similar correlation on the neighbouring Japa- 
nese island of Hokkaido, which was also rap- 
idly colonized in the early twentieth century. 
The trends underscore the importance of 


avoiding deforestation, particularly on slopes 
that are at high risk of snowslides. 

Although the decadal avalanche-fatality 
rate on Sakhalin has since dropped, and was 
about 1.1 per 100,000 people in 2000-09, 
“the findings are still very relevant” because 
a large part of the region’s settlements 
remain unprotected from avalanche risks, 
says Nishimura, who is in discussions with 
Russian colleagues to develop mitigation 
measures on Sakhalin. 

“Historical records are extremely valuable 
for avalanche mitigation,’ says Perry Bartelt, 
a researcher at the Swiss Federal Institute 
for Snow and Avalanche Research in Davos. 
In Europe, databases going back more than 
300 years are used in planning for disaster 
mitigation. Such information “is critical 
for calibrating numerical models to predict 
how far an avalanche would go at what speed 
and how much impact it would generate’, 
adds Bartelt. 

Sokratov feels that the study has much 
broader implications as a cautionary tale for 
people moving to new terrains. “Never rush 
to build roads and houses without analysing 
the risks,” he says. “With greater awareness 
and proper mitigation, many lives can be 
saved from snow avalanches and other natural 
hazards. = 
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Voters queue for the polls in Nagpur, India, as part of the ongoing general election. 


Indian elections 
fall flat on science 


Researchers concerned about country’s competitiveness 
as most party manifestos neglect innovation. 


BY SANJAY KUMAR 


s India nears the end of its mammoth 
A general-election process, research 

advocates warn that whichever party 
wins, the government must do more than pay 
lip service to science. 

Nearly 815 million people are eligible to 
vote in the staggered election, which started 
on7 April and will end with vote-counting on 
16 May. But with two of the three main par- 
ties making scant mention of science in their 
manifestos, researchers fear that Indian science 
will not get the boost it needs to become inter- 
nationally competitive. 

The manifestos of the ruling Indian National 
Congress (often referred to as Congress), the 
Bharatiya Janata Party (BJP) and the new Aam 
Admi Party (AAP) give mixed clues to what 
science policies they would pursue. 

Congress has supported science in the past, 
but makes few promises about research now. 
“Unfortunately, in the value system of both the 
government and our society at large, science is 
not at the top” — in fact, it may be at the bot- 
tom, says Chintamani Nagesa Ramachandra 
Rao, chairman of the Indian Scientific Advisory 
Council to the Prime Minister. 

Congress does vow to increase annual science 
and technology expenditure from the current 
0.9% of the gross domestic product (GDP) to 


at least 2%. But Raghunath Anant Mashelkar, 
former director-general of the Indian Coun- 
cil of Scientific and Industrial Research, notes 
that the 2% spending goal has been mooted by 
politicians from various parties since 1989. “We 
have still not crossed 1% for over two decades 
now, whereas others, such as [South] Korea, 
China et cetera are rapidly moving towards, or 
have already moved to, 3% and beyond in the 
same period” (See ‘Broken promises.) 

Ofthe major parties, the BJP has devoted the 
most space to science and technology, health, 
energy and environment in its manifesto. The 
formulation of the document was chaired by 


BROKEN PROMISES 


Successive Indian governments have pledged to 
boost science spending to 2% of the gross domestic 
product (GDP), but it has never reached even 1%. 
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physicist Murli Manohar Joshi, who was science 
and technology minister from 1998 to 2004, 
when the BJP was last in power. 

The manifesto talks of reclaiming India’s 
ancient position as a global leader in science, 
and of creating a productive environment for 
fundamental research. It promises, for example, 
to set up world-class centres for nanotechnol- 
ogy, brain research and thorium-reactor tech- 
nology for India’s nuclear-power programme. 

Being globally competitive is the main chal- 
lenge, says Rao. “We have to improve not just 
the quantity of output, but more importantly 
the quality of output in science,” he says, add- 
ing that India currently contributes no more 
than 1% of the top 1% of global research. 

“We have very few islands of excellence in 
science, sometimes with a very subcritical 
number of world-acknowledged experts,” says 
Mashelkar. 

The BJP manifesto also says that gains made 
by India during the party’s most recent stint 
in power — when it tested nuclear weapons 
— have been frittered away by Congress rule. 
“We will follow a two-pronged independent 
nuclear programme, unencumbered by foreign 
pressure and influence for civilian and mili- 
tary purposes, especially as nuclear power is 
a major contributor to India’s energy sector,’ 
it promises. 

Most opinion polls indicate that a coalition 
led by the BJP is likely to win the election with 
prime-ministerial candidate Narendra Modi. 
Some commentators are uneasy about this 
prospect. “The rise of Modi will spell a para- 
digm shift in Indian politics, with a tilt towards 
militant Hindu ideology or even explicit anti- 
minoritism, and will impact science adversely,’ 
says Pushpa Mittra Bhargava, former director of 
the Centre for Cellular and Molecular Biology 
in Hyderabad. In 2004, Bhargava lost a Supreme 
Court case to halt the BJP government's projects 
— led by Joshi — to introduce Vedic astrology 
and Hindu priestly rituals into universities. 

The AAP, which is contesting its first gen- 
eral election, has a vigorous anti-corruption 
agenda. Its candidates include leading anti- 
nuclear activists and many physicians, but its 
manifesto is largely silent on science and tech- 
nology. However, it does focus on management 
of natural resources such as coal and minerals, 
which have been at the centre of major corrup- 
tion scandals in India. The AAP advocates a 
pivotal role for communities, promising that 
their consent would be needed to exploit local 
resources, and that they would share in profits. 

Rao says that if India can make careful 
investments in particular areas of scientific 
research, it will be able to compete with coun- 
tries such as China and South Korea in terms 
of patents acquired and high-quality publica- 
tions produced. 

But he warns: “The competition being 
very severe, it is my feeling that if India does 
not catch up in the next 10-15 years, it may 
become impossible to do so later” m 
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BIOMEDICAL SCIENCE 


Universities seek to boost 
industry partnerships 


As drug companies shift focus, academia ramps up its role in bringing discoveries to market. 


BY ERIKA CHECK HAYDEN 


hen Matthew Shair discovered a 
protein involved in causing blood 
cancer — and asmall molecule that 


might disrupt that protein — he saw the mak- 
ings of a new drug. Facing years of costly work 
to prove its worth, Shair, a chemist at Harvard 
University in Cambridge, Massachusetts, did 
not seek funding from the usual sources, phar- 
maceutical companies and venture-capital 
firms. Instead, in 2012, he turned to a Harvard 
programme that supports preclinical research. 
It provided him with US$250,000 for prelimi- 
nary studies in mice and human cells, to deter- 
mine whether his drug justified undergoing 
clinical trials. 

“It gave us the freedom to do the experi- 
ments the way we wanted to do them, rather 
than the way a company might think we should 
do them,” he says. His experience may soon 
become more commonplace. Faced with the 
increasing reluctance of drug and venture- 
capital firms to fund early development work, 
academic institutions worldwide are seeking 
ways to support research that can transform a 
promising discovery into a marketable therapy 
(see ‘Capital cruncly). 

Funding for early drug-development 
research has increased over the past year. In 
April 2013, a $50-million donation quintupled 
the size of Harvard’s preclinical programme, 
which aims to turn a profit and invest that in 
more grants. And the University of Oxford, 
UK, last month launched a £1.25-million 
(US$2 million) development fund that allows 
private investors to support the commerciali- 
zation of discoveries through its business- 
development unit, Isis Innovation. 

Hybrid academic-industrial drug-devel- 
opment schemes are also on the increase. 
The Karolinska Institute in Stockholm is set 
to open a $100-million translational research 
centre funded by London-based drug firm 
AstraZeneca in July. A partnership between 
New York’s Memorial Sloan Kettering Cancer 
Center, Rockefeller University and Weill Cor- 
nell Medical College launched the $20-mil- 
lion Tri-Institutional Therapeutics Discovery 
Institute (Tri-I TDI), together with Takeda of 
Osaka, Japan, in October. And last month, the 
Scripps Research Institute in La Jolla, Califor- 
nia, announced that Johnson & Johnson of 
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CAPITAL CRUNCH 


US venture-capital funding for life sciences has stayed relatively flat over the past decade. In response, 
universities have developed new funding models to transform discoveries into marketable therapies. 


== Software 
5: es Industrial/Energy ~*) 
«= IT Services 


Venture-capital funding (US$ billions) 


1995 


1997 


1998 2001 2003 


New Brunswick, New Jersey, would be the first 
industry partner for its new ‘Scripps Advance’ 
drug-discovery institute. 

Drug companies and research institutes hope 
that the new programmes will kill two birds 
with one stone: replenish the pharmaceutical 
industry’s depleted pipeline of new drugs, and 
bring money to institutions that face shrinking 
public funding. “The more quality shots on goal 
you have, the more quality you get in terms of 
revenue, says Scott Forrest, vice-president for 
business development at Scripps. 

Backers of the programmes also hope to 
avoid the pitfalls that stymied the most recent 
wave of academic-industrial partnerships in 
the 2000s. Drug companies outsourced early 
drug-discovery work to academic researchers, 

often by paying for 


“We’re seeing a the option to com- 
drying up of the mercialize promis- 
basic-research _ing finds. But these 
enterprise, which arrangements did 
is going tocreate not yield many new 
a problem. ” drugs, because undi- 


rected basic research 

often did not synchronize with industrial goals. 
The latest partnerships are more focused: 
they explicitly funnel academic creativity into 
the pursuit of new therapies. Tri-I TDI, for 
example, aims to find drugs including those for 
diseases that industry has eschewed because 
they are unlikely to generate large profits, such 
as neglected tropical diseases and rare cancers. 
Projects will be selected by academic scientists 
and carried out by Takeda’s scientists at the 
Tri-I TDI partners facilities in Manhattan. “No 
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one gets to veto a project because it isn’t the right 
market size; says David Scheinberg, chair of 
Memorial Sloan Kettering’s experimental thera- 
peutics centre and a Tri-I TDI board member. 

Work at the AstraZeneca—Karolinska centre, 
meanwhile, is geared towards cardiovascular 
and metabolic diseases, such as diabetes. The 
venture is recruiting academic researchers to 
work alongside AstraZeneca scientists. “We 
really want to make new discoveries here that, 
in turn, could eventually possibly be drugga- 
ble,’ says Hans-Gustaf Ljunggren, the Karolin- 
ska’s dean of research. 

But such arrangements are still vulnerable to 
market forces. On 2 May, a bid by New York- 
based drug firm Pfizer to buy AstraZeneca 
was rebuffed by the company. However, it is 
not clear how the Karolinska centre would 
be affected should such a merger eventually 
go ahead. More broadly, some observers note 
that if academics and funders continue to shift 
towards more applied work, there is the risk of 
avoid being created in the basic research that 
underlies the quest for new therapeutics. Even 
key funding agencies such as the US National 
Institutes of Health are shifting towards more 
translational research. “We're seeing a drying up 
of the basic-research enterprise, which is going 
to create a problem,’ says Kenneth Kaitin, direc- 
tor of the Tufts Center for the Study of Drug 
Development in Boston, Massachusetts. 

To many institutions, and to young scientists 
such as Shair, shaping the development of their 
discoveries has become even more crucial at a 
time of scarce resources. “I love having a foot in 
both sides,” he says. “This is part of the future.” m 
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Movies of cancer cells 
in living mice show how 
streams of melanoma 
cells (green) invade 
skin tissue. 
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Methods for monitoring tumour cells in living 
animals are transforming our view of cancer. 


BY CORIE LOK 


ikala Egeblad was blown 

away when she made her 

first action film of tumour 

cells inside live mice. Until 

then, she had studied sam- 
ples on microscope slides, where the cells sat 
still, frozen in time. But seeing them ina living 
animal brought the cells to life. “You turn on 
the microscope and look in the live mouse and 
suddenly these same cells are running around 
like crazy,’ says Egeblad, a cancer researcher at 
Cold Spring Harbor Laboratory in New York. 
“Tt really changed my thinking.” 
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Increasingly, cancer researchers are embrac- 
ing the chance to spy on individual tumour 
cells in their native environment. In studies of 
static tissue cultures, investigators have to infer 
what cancer and other cells surrounding the 
tumour might be doing, and how they might 
be interacting. Tracking cancer in live animals 
over time — an approach called intravital 
imaging — puts those interactions on display, 
and allows biologists to zoom in on the small 
number of dangerous cells within a tumour 
that drive the disease or resist treatment. 

The technique is young, and labs are still 
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The cells are guided 
by structures such as 
muscle fibres (orange), 
nerve fibres (blue), 
collagen (grey) and 


blood vessels (red). 


working out how best to analyse the gigabytes 
of video data it generates. But the increasing 
use of intravital imaging over the past dec- 
ade has already helped researchers to piece 
together timelines for key cellular and molecu- 
lar events, such as the process by which tumour 
cells sneak into blood vessels. Such clues have 
yielded new hypotheses about how cancers 
grow, spread and resist treatment — informa- 
tion that could, for example, eventually enable 
drug developers to understand why some can- 
cer cells do not succumb to therapy. 

And ina video-obsessed culture, the imag- 
ing technique holds instant appeal. “When we 
show our movies, people fall out of their seats 
when they see how dynamic a tumour lesion 
can be,’ says Peter Fried] at Radboud Univer- 
sity Nijmegen in the Netherlands. “It’s a change 
in perception.” 


DEEP DIVE 

First used by cancer biologists in the late 1990s, 
intravital imaging involves focusing powerful 
microscopes directly onto exposed tissue in 
a live, anaesthetized mouse. More labs have 
adopted intravital imaging as technological 
improvements have made it possible to peer 
further into tissue — now as many as 20 cells 
deep — and to tease out fainter signals. A 
growing library of molecular markers has 
given researchers the ability to visualize up to 
eight different kinds of cells and structures, 
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including various immune-system cells and 
the endothelial cells that line blood vessels. 
“The markers and the microscopy technol- 
ogy make this a powerful combination,” says 
Frederic de Sauvage, vice-president of molec- 
ular oncology at the biotechnology company 
Genentech in South San Francisco, California, 
who has seen the technology in action. 

Putting these components together creates 
a comprehensive picture of cancer as a com- 
plex ecosystem of cells that migrate, prolifer- 
ate and interact. Although cancer researchers 
have long understood that cells in a tumour are 
genetically heterogeneous, intravital imaging is 
revealing how the behaviour of individual cells 
can also differ. For example, cancer cells may 
march in single file or collectively as a tight- 
knit group, depending on the type of tumour 
and its environment. 

One mysterious cellular behaviour that 
has landed in the sights of these microscopes 
is that of the macrophage, a type of immune 
cell that normally engulfs pathogens, removes 
dead cells and stimulates immune responses. 
Macrophages can incite immune cells to fight 
cancer, but more often they boost a tumour’s 
growth and spread. 

Intravital imaging studies showed that 
macrophages, along with tumour cells and 
endothelial cells, form a structure that pumps 
tumour cells into the bloodstream — a key step 
in metastasis. Working with rodents, research- 
ers led by John Condeelis at Albert Einstein 
College of Medicine in New York found that 
when macrophages come into contact with 
mammary tumour cells, the tumour cells 
become more invasive, degrading the protein- 
rich matrix around blood vessels and squeez- 
ing between the endothelial cells. Macrophages 
cause the endothelial cells to lose contact with 
each other, opening a hole in the vessel wall 
and allowing tumour cells to stream out of the 
tissue and into the bloodstream’. 

Condeelis’s team has shown that this ‘pump’ 
is present in human breast cancer. The group 
has also identified three molecular markers, 
one for each cell type in the structure, that 
indicate its presence in tumours. Ina study’ of 
60 people with breast cancer, individuals with a 
higher density of these pumps in their tumours 
were more likely to develop metastases in 
other organs. A start-up company, MetaStat in 
Montclair, New Jersey, has licensed this prog- 
nostic technology and is developing a test that 
predicts metastatic risk in people with breast 
cancer. The company hopes to have the test in 
clinical trials by the end of this year. Condee- 
lis’s group is also working on a probe to identify 
the pumps using magnetic resonance imaging, 
avoiding the need to take tissue biopsies from 
patients. 

Others are using intravital imaging to track 
cancer drugs in the body, and to explore why 
some drug treatments fail. Cancer biologists 
typically test the effect of chemotherapies 
in vivo by measuring changes in tumour 


growth and size in mice. Intravital imaging 
gives a more direct view, revealing which cells 
in a tumour take up the drugs, and whether 
those cells live or die. 

Egeblad and her team have made films of 
doxorubicin, a naturally fluorescent cancer 
drug, as it infiltrated mammary tumours in 
mice. They were surprised by the degree of 
variability — even within small regions of 
the tumour — in the amount of the drug that 
got into the cells, and in the number of cells 
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can take up to 15 times longer than making 
them, says Egeblad. Others note that software 
for quantitative-image analysis is limited, so 
many labs are writing their own programs. 
And intravital imaging continues to pose 
technical challenges. The technique can access 
only tissues near the surface, which makes it 
applicable to just a few tumour types, says de 
Sauvage. It has also been difficult to integrate 
intravital imaging with classical tools of molecu- 
lar biology, such as fluorescent biosensors used 


“When we show our movies, people 
fall out of their seats when they see 
how dynamic a tumour lesion can be.” 


that died. One important factor, they found, 
was the ‘leakiness’ of the blood vessels in the 
tumour’. Mid-stage tumours, which have more 
porous blood vessels than early- or late-stage 
tumours, were more sensitive to the drug. 
Compounds that boost the permeability of 
vessels could therefore improve the delivery 
of cancer drugs, suggests Egeblad. 


VIEWING WINDOW 

To capture films in live mice, researchers were 
initially restricted to a single imaging session. 
Ideally, they would like to watch tumours in 
the same animal over days or weeks to track 
longer-term changes. Many are adopting a 
technique that implants a glass coverslip in 
a frame in the mouse’ skin. These windows, 
which can provide views into areas including 
the brain, abdomen and mammary glands, 
allow investigators to image the same location 
in the same mouse many times over. The mice 
wake up after an imaging session and carry on 
as normal in their cages. 

Using the windows, a team led by Jacco van 
Rheenen at the Hubrecht Institute in Utrecht, 
the Netherlands, watched colorectal-cancer 
cells colonize the livers of living mice over the 
course of two weeks. Newly arrived cancer cells 
moved within small areas of the organ during 
the first few days, but by day five they had 
stopped migrating and were becoming densely 
packed. The team found that in mice in the 
early stages of tumour spread, treatment with a 
molecule that inhibits cell migration decreased 
the number of metastatic liver tumours that 
developed later’. 

As intravital imaging of cancer has matured, 
the field has moved beyond eye-catching films 
and has begun to generate quantitative data 
detailing, for example, the speed and direction 
of moving cells. Such data allow researchers to 
construct and refine mathematical models of 
cell behaviour. These could predict, for exam- 
ple, how tumour cells invade tissues, says Friedl. 

But generating such quantitative data is diffi- 
cult and time-consuming: analysing the movies 
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by researchers to see when and where cell-sig- 
nalling pathways are turned on. Many of those 
sensors work well in vitro — where cells can be 
manipulated to amplify changes in signalling — 
but are not sensitive enough to pick up the more 
subtle changes seen in vivo, says van Rheenen. 
This sort of information might provide clues 
to the molecular escape routes that allow some 
tumour cells to dodge the effects of cancer 
drugs, says Scott Powers, a cancer geneticist at 
Cold Spring Harbor. “It would be nice to know 
what's happening biochemically inside the cells 
that’s making them do what they’re doing” 
Egeblad is now integrating biochemical and 
genetic tools in her imaging work. She will soon 
be launching a new project to trace the history 
of different subsets of cells in mouse mammary 
tumours as they grow over several weeks. At the 
end of the experiment, her team will remove the 
tumours and sequence the genomes of individ- 
ual cells. The aim is to link genetic signatures 
to cellular behaviours, such as rapid growth 
or drug resistance, in different regions of the 
tumour. The team also plans to image the activ- 
ity of key cancer genes in mice as tumours grow. 
For Egeblad, the new project is a chance to 
return to the questions that first drew her to 
intravital imaging: how do different compo- 
nents of the tumour and its environment co- 
evolve? Powers says that working with Egeblad 
and watching her movies helped him to see 
how the tumour’s environment, not just its 
genetics, can influence cancer. “How could it 
not have an impact?” he says. “You're record- 
ing things that haven't been recorded before?” m 


Corie Lok is an editor for Nature in 
Cambridge, Massachusetts. 
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DIVISIONS 


Since the birth of synthetic biology almost 
15 years ago, the field has splintered into 
diverse tribes of scientists, all attempting 

to endow cells with new abilities. 


BEYOND DIVISIONS 


The future of synthetic biology 
nature.com/synbio 
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ntil just a short time ago, the boundaries of 
biology were set by nature. But those limits 
are receding. In the past decade, the field 
of synthetic biology has shattered the idea 
that living cells must stick to the biomolecular 
widgets that have evolved naturally. Synthetic 
biology — a more complex form of genetic 
engineering — allows researchers to assemble 
molecular gears on an entirely new scale. From 
such tools, biological freedom has emerged; 
even the rule that DNA’s alphabet consists of just 
four letters has been thrown out of the window. 

In this special issue, Nature surveys the messy 
landscape of synthetic biology and the various 
scientific tribes that are pushing it forward. 
Although the field is young, it is already divided 
into factions that foresee different futures. 

A News Feature on page 152 investigates the 
schism between those who advocate free-for-all 
tools, such as open-source repositories, and the 
groups that seek to protect the fruits of research 
using legal instruments such as patents (see also 
Editorial, page 133). Volker ter Meulen follows 
that theme on page 135, highlighting the threat 
of excessive regulation in the field. On page 155, 
experts diagnose the disciplines biggest chal- 
lenges and suggest solutions through building 
bridges across scientific and cultural divides. A 
News & Views Forum on page 166 debates the 
merits and drawbacks of engineering useful 
features into cells by fashioning genetic circuits 
directly or through successive rounds of screen- 
ing and selection. And on page 168, Daniel G. 
Gibson and J. Craig Venter describe the research 
power ofa synthetic yeast chromosome built to 
scramble genes and then probe which combina- 
tions can keep cells growing. 

Even the most basic building blocks of 
biology are being remodelled. On Nature's 
website, Denis A. Malyshev and co-authors 
describe a cell that propagates with unnatural 
nucleic acids in its DNA (D. A. Malyshev et al. 
Nature http://dx.doi.org/10.1038/nature13314; 
2014). As explored in an accompanying News 
& Views article, this kind of expanded genetic 
alphabet might eventually allow the regulatory 
functions of DNA to be more precisely engi- 
neered, or permit new types of amino acid to 
slip easily into designer proteins (R. Thyer and 
J. Ellefson Nature http://dx.doi.org/10.1038/ 
nature 13335; 2014). Two of Nature’s sister 
journals, Nature Methods and Nature Reviews 
Microbiology, are also exploring applications 
and advances in synthetic biology. Together, 
the package of reports and commentaries 
provides a broad overview of how synthetic 
biology will be building bridges and pushing 
limits for decades to come. m 
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CULTURAL 
DIVIDE 


Synthetic biology is facing a tug of war 
over whether to patent its discoveries or 
embrace open-source innovation. 


BY BRYN NELSON 


idea about how to cure breast cancer. He asks: what if volun- 

teer researchers, working cooperatively from their garages and 
bedrooms, could rival the efforts of multibillion-dollar pharmaceutical 
companies? 

His crowd-funded venture, the Pink Army Cooperative, is trying to 
do just that by tapping into open-source tools springing from synthetic 
biology — an emerging field that designs biological products using 
engineering principles and a modular approach. Since the coopera- 
tive launched in 2009, nearly 600 people have invested in it. The cost 
to join? A mere US$20. 

This radical idea faces considerable hurdles — but even so, it has 


A Canadian futurist named Andrew Hessel has an unorthodox 


= attracted plenty of attention from industry groups and the media. The 
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cooperative, launched by Hessel and two co-founders, hopes to start 
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cell-culture studies this year and is considering a therapeutic trial in dogs. 

Currently based at the software-design firm Autodesk in San Francisco, 
California, Hessel represents an increasingly impatient and outspoken 
faction of synthetic biology that believes that the patent-heavy intellec- 
tual-property model of biotechnology is hopelessly broken. His plan relies 
instead on freely available software and biological parts that could be 
combined in innovative ways to create individualized cancer treatments 
— without the need for massive upfront investments or a thicket of protec- 
tive patents. He calls himself a “catalyst for open-source synthetic biology”. 

This openness is one vision of synthetic biology’s future. Another is 
more akin to what happens at big pharmaceutical companies such as 
Pfizer, Merck and Roche, where revenues from blockbuster drugs fund 
massive research initiatives behind locked doors. For such businesses, the 
pursuit of new drugs and other medical advances depends heavily on pro- 
tecting discoveries through patents and restrictive licensing agreements. 

Tight controls on intellectual property are necessary to encourage 
promising medical developments, says the Biotechnology Industry 
Organization (BIO) in Washington DC, the sector’s dominant trade 
association. On its website, BIO calls intellectual property “imperative 
for innovation” around the world. “Societies that protect inventors with 
patents are the world’s most advanced — scientifically and technologi- 
cally,’ it says. 

How synthetic biologists resolve the conflict between open source 
and patent protection could determine whether the field delivers on its 
ambitious goal of transforming medicine, agriculture, energy, environ- 
mental remediation and other industries through precision engineering. 
“It’s not just return on investment,’ says Linda Kahl, director of the legal 
programme at the BioBricks Foundation, a non-profit organization in 
Cambridge, Massachusetts, that advocates for biological engineering in 
the public interest. “It’s not just commercial applications. It is also about 
doing good in the world.” 


TWO CULTURES 
Although its roots extend back to the early twentieth century, synthetic 
biology started sprouting as an organized field just over a decade ago. In 
2003, only 3 peer-reviewed articles listed in Elsevier’s Scopus database 
used the term synthetic biology; in 2013, more than 800 did. Last year, 
the field also marked one of its biggest developments. Capitalizing on 
a discovery by biochemical engineer Jay Keasling of the University of 
California, Berkeley, the Paris-based pharmaceutical firm Sanofi began 
large-scale production ofa partially synthetic form of the malaria drug 
artemisinin, which is normally derived from plants (see Nature 494, 
160-161; 2013). And more big advances are in the pipeline: at the Pacific 
Northwest National Laboratory in Richland, Washington, for example, 
researchers are creating synthetic fungal enzymes that can convert sug- 
ars from broken-down plant biomass into fuels and other industrially 
useful chemicals. 

From the start, the field has been an amalgam of disparate influences, 
each with different cultures of intellectual property. On one side sit soft- 
ware design and engineering, which introduced the idea of encoding 
desired functions in pieces of DNA and joining together a standard- 
ized set of biological widgets, much like bricks or Lego pieces. Software 
engineers also brought with them the philosophy of sharing their work 
using open, public registries or only lightly restrictive licensing agree- 
ments, such as copyrights. 

On the other side sit molecular biology and biotechnology, which 
supplied know-how about messy and unpredictable biological systems. 
They also brought the practice of patenting genes, molecules and tech- 
nical processes. Half of the papers published in Nature Biotechnology 
between 1997 and 1999, for example, were linked to a patent. “They 
came with different perspectives, different goals, and in some cases, 
different expectations,’ says Andrew Torrance, 
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June last year, in a case brought by the Association for Molecular Pathol- 
ogy against the company Myriad Genetics, the US Supreme Court ruled 
unanimously that “products of nature” such as genes and genetic markers 
are not eligible for patents. But by its very nature, Torrance says, synthetic 
biology creates DNA that does not occur naturally — and so the court’s 
ruling explicitly allows such human-designed DNA to be patented. 

Legally, therefore, synthetic-biology sequences and techniques can 
be patented, at least in the United States. But the morals and ethics of 
doing so are vigorously debated by researchers, companies, lawyers and 
bioethicists. 

Patent advocates say that protecting intellectual property is necessary 


to spur innovation. In a statement after the rul- 
“ITS NOT JUST 


ing against Myriad, Craig Venter, founder and 
RETURN ON 


chief executive of Synthetic Genomics in La 
INVESTMENT. 


Jolla, California, applauded the court for mak- 
ing a distinction between naturally occurring 
and human-derived DNA segments. “These 
man-made genetic constructs are already 
IT IS ALSO being used to create new vaccines, biofuels and 
nutritional products,” he said. “And the ability 
ABOUT DOING to protect this intellectual property is a neces- 
sary component of a vital and robust science 

G00 D | N TH F and biotechnology industry.” 
Many synthetic biologists are indeed pat- 
WORLD ” enting their work. Writing in Systems and 
' Synthetic Biology last year, a group of research- 
ers in Germany documented a trend towards 
increasing patent applications in the field — particularly in the energy, 
medical and industrial sectors (D. van Doren et al. Syst. Synth. Biol. 7, 
209-220; 2013). Lead author Davy van Doren, an emerging-technolo- 
gies researcher at the Fraunhofer Institute for Systems and Innovation 
Research in Karlsruhe, Germany, concedes that the trend is inferred 
from a limited number of patents and a short time frame — but says 
that it is consistent with other areas of biology. “We couldn't find any 
evidence that patent trends in synthetic biology might be different com- 

pared with other domains,” he says. 

But open-source advocates argue that patents squelch innovation. This 
opinion is widespread in start-up companies, non-profit organizations, 
graduate programmes and the wildly popular annual International Genet- 
ically Engineered Machine (iGEM) competition, in which university and 
school students compete to make synthetic systems that work in living 
cells. These researchers say that if companies and universities can patent 
key synthetic-biology tools and building blocks, they can charge hefty 
fees for others to use them, making it prohibitively expensive to create 
new products building on those discoveries — especially for start-ups 
and organizations with few resources. 


OUT IN THE OPEN 

The free-for-all synthetic-biology movement is advancing its aims by 
assembling public registries analogous to open-source software regis- 
tries. The iGEM Registry of Standard Biological Parts, the oldest and 
biggest, contains samples submitted mainly by teams that have entered 
the competition. By building up a critical mass of components in repos- 
itories dedicated to public use, Torrance says, sharing advocates are 
creating a commons that future innovators can rely on for synthetic- 
biology building blocks, dubbed biobricks. 

Individual parts from these collections can be incorporated into larger 
and potentially patentable inventions, but theoretically it is harder to pat- 
ent the basic parts if they are already in the public commons. The iGEM 
Registry alone is growing by a few thousand parts per year. But there is a 
big caveat: no one can say with any certainty how many of these parts are 
themselves entirely free of patent claims. Not 


a law professor at the University of Kansas in 

Lawrence who focuses on synthetic biology. 
Which intellectual-property culture will come 

to dominate synthetic biology is still unclear. In 
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all researchers are willing or able to verify that 

parts that they label ‘open-source’ actually are. 
To provide some clarity, the BioBricks Foun- 

dation has developed a legal tool known as 
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the BioBrick Public Agreement: a contract for acquiring standardized 
biological parts on an open-source basis. Contributors agree not to assert 
any existing or future intellectual-property rights on a biological part 
that they have developed, in exchange for a promise by users that they 
will give proper credit to the developer and abide by security regulations. 

Although public registries are rapidly expanding, few have the curato- 
rial capacity to verify that every part works as claimed. “You're not get- 
ting finished, high-quality pieces of DNA in every case,” Torrance says. 
Some researchers are more blunt. “There’s a lot of crap in there,’ says 
Keasling. To encourage sharing while 
weeding out the junk, he favours 
requirements for authors of journal 
articles to deposit standardized part- 
source data and descriptions into a 
repository — provided that the field 
can agree on a fair set of rules. 

Advocates say that the iGEM Reg- 
istry has improved vastly in recent 
years, and point to new collections 
that are emphasizing quality control. 
The Synthetic Biology Engineer- 
ing Research Center in Emeryville, 
California, is developing an Internet 
interface called the Web of Registries 
to stitch multiple databases into a 
linked system that offers uniform 
information about parts and their 
legal status. If the field takes off like 
open-source software, Torrance says, 
it may attract a do-it-yourself crowd 
that will help to verify that parts do what is advertised, just as some 
software enthusiasts currently spend their free time fixing bugs. 

Synthetic biologists on both sides of the debate say that few in the field 
take an absolutist view on patents. Many are instead homing in on the 
idea of a ‘diverse ecology’ — one that includes both intellectual-property 
protections and public-sharing agreements (J. Calvert BioSocieties 7, 
169-187; 2012). Complexity matters here: if the synthetic-biology build- 
ing blocks are compared to Lego, then in this situation the bricks would 
be free but a design for a complex rocket ship made of hundreds of Lego 
pieces would be patentable. 


SHARE AND PROTECT 

To give an idea of what a robust and commercially friendly open-source 
regime might mean for synthetic biology, Hessel points to the Linux 
computer operating system, the open-source platform that became so 
popular that it is now among the most widely used in the computer 
industry. Although the base operating system is free, developers have 
built proprietary businesses onto it, just like biotech companies might 
be able to incorporate free synthetic-biology building blocks into more 
sophisticated and patent-worthy systems. 

Ginkgo Bioworks in Boston, Massachusetts — which bills itself as the 
world’s first organism-engineering foundry — is part of an emerging 
class of synthetic-biology companies that have embraced both public 
and proprietary models. Among its many projects, Ginkgo is engineer- 
ing yeast cells to produce chemicals including flavours and fragrances, 
such as a designer rose extract. The company is not averse to taking 
out patents on such advanced creations, but it also has a stake in open- 
source science: Ginkgo sells a $253 kit for assembling biobricks from 
iGEM’s Registry of Standard Biology Parts into a multicomponent 
genetic system. And in 2011, the company agreed to make publicly avail - 
able one of its engineered constitutive promoters, a DNA regulatory 
segment that allows a gene to be continually copied into RNA. 

“Tt’s not particularly useful to be patenting individual parts, per se, 
except in very specific cases,’ says Ginkgo co-founder Reshma Shetty. 
“So I think having the commons available to everyone is a good thing 
for everyone.” 
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Although biobricks and other open-access parts have already shown 
their potential in research projects, advocates say that it is still too early 
to predict how much they will be accepted into commercial research 
and development. Ginkgo co-founder Tom Knight, a computer engi- 
neer at the Massachusetts Institute of Technology in Cambridge who is 
often called the father of synthetic biology, says that the field is shifting 
away from a focus on handcrafted biology, towards a system in which 
large-scale foundries create standardized parts en masse. If this shift 
continues, then companies will want to use small parts interchangeably 
and without complex patent agreements from every 
manufacturer. Patents will not disappear entirely, 
says Knight, but they might have a limited role in 
an industrial context. 

Even if the field evolves towards a middle way, the 
debate will continue. The biotech industry, for one, 
worries about liability issues associated with free- 
for-all biological parts. Ifa publicly available build- 
ing block is incorporated into a transgenic seed or 
medical treatment, for example, it is not obvious 
who is responsible for tracking down its provenance 
and demonstrating to regulatory authorities that the 
part is safe. 


INNOVATE OR DIE 

In the tug of war between patents and open-source 
registries, a nagging question remains: which mech- 
anism is better at driving innovation? In 2003, the 
US National Research Council issued a report called 
Patents in the Knowledge-Based Economy, which said 
that the evidence on how patents affected innovation 

was still “emergent” A decade later, the uncertainty persists. 

“There are a lot of people looking into this question,’ says Torrance. “It 
is amazing to me that in 2014, it’s impossible to point to a definitive study 
that indicates that the patent system or the copyright system is a net ben- 
efit or a net cost to the economy or to innovation. But that's where we are.” 

It may not matter what the data say, according to Hans Sauer, deputy 
general counsel for intellectual property at BIO. “For better or worse, 
we're just committed to a system that depends on the availability of 
patents, at least to some extent, for greasing the wheels that put the 
biotech business model in motion” In other words, whether patents 
actually spur innovation may be trumped by the widespread view of 
entrepreneurs and investors that they are key to minimizing risk in the 
start-up phase. “These people, rightly or wrongly, all act on their beliefs 
and their convictions,” says Sauer. 

Many in the biotech industry have difficulty imagining a world with- 
out patents, he adds. The industry is “a bit spooked” about emerging 
public hostility to patent rights, and about a legal pendulum that seems 
to be swinging towards a more restrictive application of patent law, at 
least in the United States. 

The hesitancy is understandable, says Hessel, given the biotech indus- 
try’s big investments and long lead times. But a new generation of nim- 
bler, leaner open-access types is not bound by such restraints. “What 
were seeing is a kind of transition era, where there’s this new community 
emerging and it’s in some ways competing intellectually with the cur- 
rent, established industry,’ he says. The old guard can go on worrying 
about downstream investment costs and liability, says Hessel, while he 
and his Pink Army invent a way to cure cancer. 

In March, Hessel spoke to pharmaceutical executives and consultants 
at the Pharma Summit 2014 in London about the Pink Army Coopera- 
tive’s cancer-therapy work. And BIO has invited him to join a panel 
discussion about emerging trends in biotechnology at the BIO Inter- 
national Convention in San Diego, California, in June. Although the 
cultural gap between the two camps remains wide, there are signs that 
the bridge-building has begun. m 


Bryn Nelson is a freelance science writer based in Seattle, Washington. 
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How best to 
build a cell 


Experts weigh in on the biggest obstacles in 
synthetic biology — from names to knowledge 
gaps — and what it will take to overcome them. 
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Bring in the 
biologists 


Professor of biomedical engineering, 
Boston University 


Synthetic biology is often described as 
bringing together engineers and biologists 
to build genetic circuits for some useful task. 
In fact, the field has engaged relatively few 
biologists. This is holding back its progress. 
We do not yet know enough biology to make 
synthetic biology a predictable engineering 
discipline. 

The engineering slant of synthetic biology 
has brought impressive accomplishments. 
These include whole-cell biosensors; cells 
that synthesize antimalaria drugs; and bacte- 
rial viruses designed to disperse dangerous, 
tenacious biofilms. 

To design these, engineers are trained to 
model systems as black boxes, abstractly 
linking inputs and outputs. They can often 
control a system with only a limited under- 
standing of it. But synthetic-biology projects 
are frequently thwarted when engineering 
runs up against the complexity of biology. 

Synthetic biology would benefit greatly 
from deeper insights into the mechanisms 
of biological systems. Such approaches have 
already yielded insights into how organized 
processes in cells work because of, rather 
than in spite of, noisy gene expression. Syn- 
thetic biology is also informing biology, 
helping to reveal how a gene product can 
amplify or inhibit its own expression and so 
allow cells to flip between stable states. Much 
more remains to be explored and discovered. 

Over the past decade, many engineers 
have become interested in ‘doing biology’. 
Although biologists need not feel compelled 
to ‘do engineering, biologists and engineers 
should learn to learn from each other. We 
must overcome cultural differences and 
biases. We must move beyond dismissive, 
tribal comments such as, “Well, she’s not 
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a biologist” or “That's not engineering”. 
This requires time, outreach and retraining. 
But the pay-off will be more contributions 
that are more meaningful, to both basic biol- 
ogy and biotechnology. 


Agree ona 
definition 


Head of biosciences strategic planning 
and development, Lawrence Berkeley 
National Laboratory 


Researchers, regulators, consumers and 
detractors in the United States cannot agree 
on whether synthetic biology is a new 
discipline of engineering or an exten- 
sion of biotechnology. Debates abound 
over whether geneticist Craig Venter 
created artificial life using synthetic 
biology, and whether that term is just 
another name for genetic engineer- 

ing. These ambiguities complicate 
discussions among scientists, hinder 
policy-makers, impede efforts to fund 
synthetic-biology research and thwart 
regulation that might build public 
confidence. 

Emerging technologies that have more 
precise definitions have captured signifi- 
cant federal support, in terms of funding 
and policies. For example, the Network- 
ing and Information Technology Research 
and Development programme, created by 
Congress in 1991, is slated to receive funds 
of about US$3.8 billion in 2015, while the 
National Nanotechnology Initiative, created 
in 2001, could receive $1.5 billion. 

The federal government already has an 
interest in synthetic biology. In its National 
Bioeconomy Blueprint, the administration 
of President Barack Obama included syn- 
thetic biology as one of a few technologies 
for which federal investments could yield 
economic outcomes through scientific 
advances. Practical applications spanned 
health, energy, the environment and food. 
In November 2013, Congress requested 
a briefing from the American Chemical 
Society about how synthetic biology might 
boost the economy and well-being, having 
previously asked in the 2012 appropriations 
bill that the Department of Energy submit a 
plan to assess the field. 

Without a consensus on what synthetic 
biology is and is not, informed policy is dif- 
ficult to set. If synthetic biology is the same 
as biotechnology, it is supported by other 
programmes, and so does not need a large- 
scale, federally coordinated effort. Congress 
is also unlikely to support the field if, as a 
March 2013 survey found, synthetic biology 
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is perceived by the public as “unnatural, 
man-made, and artificial” with as many risks 
as benefits (see go.nature.com/xlkfmk). 

US scientists should follow the example of 
the UK Synthetic Biology Roadmap Coor- 
dination Group, which published its report 
in July 2012 (see go.nature.com/yneivc). 
One year later, the UK science minister 
announced £60 million (US$101 million) 
in new support for the field. 


Build green 
‘bioalchemists’ 


Professor of biochemistry, University 
of Texas at Austin 


The biggest challenge for synthetic biology 
is how to extend beyond projects that focus 
on single products, organisms and processes. 
Right now, most applications engineer 
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bacteria that start a synthesis with glucose 
and turn out biofuels or fine chemicals, such 
as vanillin or artemesinin. A broader scope 
could help to build a ‘greener’ economy, in 
which more organisms make a greater range 
of chemicals. 

The chemical industry is a marvel of effi- 
ciency, taking raw materials such as oil and 
converting them into a wide range of prod- 
ucts, including plastics and pharmaceuticals. 
This is possible in part because feedstocks can 
be interconverted through various large-scale 
reactions for which catalysts and processes 
have been optimized over several decades. 

Synthetic biology could unlock the large- 
scale use of carbon sources from ligno- 
cellulose to coal. Synthetic ‘bioalchemy’ 
would reformat the basic elements of life to 
take advantage of abundant supplies of for- 
merly rare intermediates such as the nylon 
precursor adipate, which is used to synthe- 
size antibiotics. Metabolic engineering is 
already capable of syntheses that use glucose 
or other standard carbon sources as precur- 
sors, but the co-culture of synthetically 


modified organisms would make these 
processes more efficient. The ability to engi- 
neer photosynthetic organisms might even 
allow light to be used as the ultimate energy 
source and carbon dioxide as the ultimate 
carbon source. 

To bring engineered organisms into the 
chemical industry, the field must diversify 
the hosts and acquire a much deeper know- 
ledge of metabolism and ecology so that 
microbes can be designed to operate on feed- 
stocks much broader than the simple sugars 
they currently use. Many of the notions of 
standardization that have so far been the 
hallmark of synthetic biology will fly out the 
door. Manufacturing will come to rely ona 
new class of ‘broker’ who connects diverse 
feedstocks and organisms through new 
chemistries. Additional input: Hal Alper & Richard Murray 


MARTIN FUSSENEGGER 
Make tools for 
mammalian cells 


Professor of biotechnology and 
bioengineering, Swiss Federal 
Institute of Technology, Zurich 


For synthetic biology to be useful in medicine, 
more and better tools are needed for work 
with mammalian cells. The tools that are now 
standard for bacteria are missing or under- 
developed for mammalian cells. (Similar 
challenges exist for engineering plant cells.) 

To engineer bacteria, researchers can con- 
trol recombination in genomic segments and 
also craft large segments of DNA (such as 
episomes and artificial bacterial chromo- 
somes) to copy and carry out new genetic 
programs. For mammalian cells, making 
large chunks of DNA is a challenge, as is 
getting such DNA to behave properly in the 
cell. Most mammalian cells hold two copies 
of their genome, and expression within the 
genome is controlled in part by epigenetics: 
chemical markings on DNA and its associ- 
ated proteins. Mammalian cells also contain 
specialized structures called organelles, and 
are programmed to form organized assem- 
blies and to launch cell death. 

This means that mammalian cells can 
provide greater complexity than bacterial 
cells. Mammalian cells could be engineered 
to carry out a network of metabolic reac- 
tions. Similar to a prosthetic leg or hand, 
prosthetic networks would perform essen- 
tial functions. Implanted in the body, these 
designer cell communities would monitor 
disease-relevant metabolites in the blood 
and coordinate the production and release of 
a therapeutic compound as the body needed 
it. In mice, such systems have been success- 
ful at staving off gout and obesity. 


Ideally, prosthetic networks would 
constantly adjust to the body’s requirements, 
addressing the shortcomings of pills — 
inflexible units that give patients with varia- 
ble needs too high or too lowa dose. To make 
designer cells that produce the right amount 
of drug at the right time, researchers must be 
able to build better chunks of DNA, decorate 
them with appropriate epigenetic marks, and 
get both DNA and epigenetic modifications 
to persist even when cells divide. No matter 
how sophisticated the tools, bacteria will not 
be up to this task. 


RON WEISS 
Automate 
efficient design 


Professor of biological engineering, 
Massachusetts Institute of Technology 


As synthetic biologists build ever more 
genetic parts, efficient design has become 
a major stumbling block. In a system with 
three components, many of the possible 
combinations can be explored experimen- 
tally, so a design does not need to be perfect. 
However, once the number of components 
exceeds five or so, it often becomes imprac- 
tical to explore the relevant design space 
completely. Without a much greater effort, 
synthetic biology will be left with many clever 
parts, but only limited 


and inefficient waysto “A design tool 
combine them. should hide 

When computer messy innards 
programmers create [9 let scientists 
a software applica- focus on the 
tion, they donot write 9 gic behind 
every line of code aworking 
anew, nor do they system. ” 


select each machine 
instruction by hand. Instead, they reuse 
existing subroutines and libraries, and leave 
the job of selecting specific machine instruc- 
tions to a compiler program. 

Synthetic biology needs analogous tools. To 
make them will require better ways to char- 
acterize and tune individual parts across vari- 
ous conditions, such as a cell’s growth state. 
Characterization must go beyond simple ‘on’ 
or ‘off’ categories to reflect a range of activi- 
ties (see go.nature.com/hr8iu4) and to allow 
design tools to incorporate modules and parts 
that can function together. Such parts include 
insulating devices that prevent modules inter- 
fering with each other in undesirable ways or 
taking too heavy a toll ona cell’s resources. 

A design tool should hide messy innards 
to let scientists focus on the logic behind a 
working system, rather than on the physi- 
cal construction of its parts. For exam- 
ple, imagine a ‘biocompiler’ that receives 
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programming instructions for integrated 
modules and automatically selects compat- 
ible genetic components. 

Researchers working on such tools meet 
annually at the International Workshop on 
Bio-Design Automation. There are constant 
improvements in the approach to biological 
design challenges; still, much more progress 
is required. 


HERBERT SAURO 
Capitalize on 
evolution 


Associate professor of bioengineering, 
University of Washington 


Engineering life is not like engineering 
concrete or silicon. Bioengineers painstak- 
ingly craft a design, and a day later it has 
crumbled in the face of evolutionary selection. 
Synthetic biologists must learn to address this. 

Ideally, designs should function for as long 
as possible. For that, we can carefully engi- 
neer the DNA sequence and use microbial 
strains that are less susceptible to mutation. 
We can even construct dual-purpose DNA: 
read in one direction, it drives the engi- 
neered pathway; read in the other, it drives 
some essential process. Because mutations 
in the dual-purpose DNA will disrupt the 
essential process, the cell can survive only if 
the engineered circuit is also expressed. 

Another strategy borrows from nature. 
Long before human engineers, the bacterial 
virus phage ) evolved to stay undetected in 
its host for as long as possible. The virus sub- 
tly monitors its host by expressing a single 
protein at low levels so that the bacterium is 
barely affected. If the phage detects ill health 
in the host, it activates a program that ulti- 
mately kills the bacterium (and releases new 
phage particles). Engineers can use similar 
strategies to kill off cells if they are not per- 
forming as engineered. 

Conversely, should an engineered organ- 
ism escape the laboratory, we would like evo- 
lution to eliminate it. One could engineer a 
kill switch or even accelerate evolution so 
that engineered circuits are eliminated in 
the absence of some lab-specific conditions. 
But how these controls would fare in a natu- 
ral environment is not yet fully understood. 

Understanding engineered systems that 
evolve requires a marriage of disciplines that 
are not normally associated with each other, 
and grant panels and agencies are finding it 
difficult to judge these marriages. On the 
one hand, engineers may have little time for 
evolutionary theory; on the other, evolution- 
ists may have little interest in engineering. 
But scientists must consider both together to 
get the most from synthetic biology. = 
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Development in the city of Jiangyin, which lies close to the Yangtze River near China’s east coast. 


Realizing China’s 
urban dream 


Localimplementation and public scrutiny will make 
or break the government’s urbanization strategy, 
say Xuemei Bai, Peijun Shi and Yansui Liu. 


( “en is urbanizing at an unprece- 
dented rate. It is perhaps the greatest 
human-resettlement experiment in 

history. Between 1978 and 2012, the fraction 

of the nation’s population dwelling in cities 
increased from 17.9% to 52.6% (see ‘Urban 
expansion). If the current trend holds, 

China's urban population could top 1 billion 

people in the next two decades. 

These are uncharted waters, but China has 

a plan. In March, the government released 

the National New-type Urbanization Plan, 

which sets targets for China’s urban popula- 
tion fraction to rise by 1% a year to reach 

60% by 2020. 

The plan is comprehensive and ambitious. 

It covers almost every conceivable aspect of 

urbanization, from rural-urban migration 

and integration to the spatial distribution 
of and linkages between cities; sustainable 
development; institutional arrangements; 
and implementation. It sets numerical goals 

(see ‘Government targets’) and as a guid- 

ing principle emphasizes a sustainable and 
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people-centred approach, paying more 
attention to welfare and well-being — a 
significant and positive shift from the cur- 
rent economic focus on land development. It 
also aims to rectify existing problems associ- 
ated with the rapid urbanization in the past 
three decades. 

The right national strategy is necessary. But 
it is not sufficient. It is local practices that will 
make or break China's urbanization plan. 


GOVERNMENT GOALS 

President Xi Jinping and Premier Li Keqiang 
see sustainable urbanization as a policy pri- 
ority for three reasons. 

First, urbanization and industrialization 
are the two engines of modernization and 
economic growth. High-income countries 
typically have high urban population pro- 
portions — 70-90% for countries in the 
Organisation for Economic Co-operation 
and Development. By comparison, China’s 
urbanization lags behind its industrializa- 
tion. Research shows that urbanization does 
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promote economic growth in China’, even 
though the causal link between the two has 
been questioned’. 

Second, to keep its economy growing, 
China needs to shift from its current reli- 
ance on exports by spurring domestic 
demand. Urbanization enhances domestic 
consumption: per capita income in China is 
more than three times higher in urban than 
in rural communities. 

Third, urbanization can catalyse regional 
development. With less than 0.1 hectares of 
arable land per farmer, rural China has little 
room to boost income growth by expand- 
ing agriculture. Urbanization will reduce the 
poor agricultural population and the num- 
ber of smallholder farms, and open up pos- 
sibilities for accelerated rural development 
through large-scale, modernized farming 
practices. 

Development is at the core of these 
rationales, and the government aims to do 
it in a sustainable way. Building and oper- 
ating growing cities requires materials and 
energy, but the demand for some of these 
will exceed the country’s supply capacity’. 
And daunting environmental and social 
challenges caused by past urbanization 
need to be avoided. 

China aims to guide the development 
of urban areas according to their resource 
and environmental carrying capacity; 
improve land, water and other resource 
use efficiency; stabilize the employment of 
migrating workers; and promote green and 
low-carbon urban construction and opera- 
tion. Its 12th Five-Year Plan for 2011-15 
also aims to reduce construction land use 
per unit of gross domestic product (GDP) 
by 30%, energy intensity by 16% and carbon 
dioxide emission intensity by 17%. 

There is, of course, a catch: if the income of 
new urban residents matches that of existing 
ones, the consumption that will fuel China’s 
economic growth will present sustainability 
challenges to the country and other parts of 
the world. For example, the rise in CO, emis- 
sions that would result if China's household 
consumption patterns converge with current 
US levels could not be offset by efficiency 
gains alone’. Promoting low-carbon lifestyles 
is mentioned in the plan only in passing and 
without concrete measures or targets for how 
they might be achieved. 


POLICY CHALLENGES 

To live the urban dream, China faces three 
policy challenges: land, people and the envi- 
ronment. 

In the past decade, the urban built-up land 
area in China has grown by 78.5% — faster 
than its urban population, which grew by 
46%. The rapid pace of landscape urbani- 
zation is driven by Chinese cities’ over- 
reliance on income from land release’, which 
can amount to 70% of their total revenue. 
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SOURCES: 1, CHINA STATISTICAL YEARBOOK; 2, NATIONAL BUREAU OF STATISTICS OF CHINA 


Land-release income increased by a factor 
of six between 2005 and 2011. 

About half of urban growth has been at the 
expense of arable land, raising concerns about 
food security. To curb the loss of agricultural 
land, the central government has introduced 
strict regulations, mandating that cities offset 
developed land by securing the same amount 
of arable land elsewhere or reducing the area 
of land used for rural housing. 

These policies have had limited success, 
and in some cases, adverse consequences. In 
a desperate search for non-agricultural land 
to develop, cities have reclaimed wetlands 
and lakes and converted pristine moun- 
tains. At the same time, developed land is not 
being used efficiently — attention quickly 
shifts to the next development project. 

The flow of people from rural commu- 
nities to cities raises problems. Migrating 
peasant workers, currently estimated at 
250 million, are prevented from integrating 
into urban communities by the household 
registration system. This restricts the official 
changing of citizens’ permanent residence, 
and effectively limits those with rural regis- 
tration from gaining access to urban medi- 
cal insurance and other services. Migrants 
typically live in ‘urban villages; often former 
villages or new shanty towns on the edge of a 
settlement, without adequate social security, 
health care or schools’. 

Although forbidden, ‘forced urbanization 
is widespread. Rural residential lands 
are being expropriated with insufficient 
compensation and inhabitants forced 
into newly developed apartment build- 
ings. Peri-urban communities — those 
that straddle urban and rural areas — that 
are affected by expansion often become 
more vulnerable both socially and eco- 
nomically, with a sudden loss of tradi- 
tional livelihoods such as farming and 
fishing, and without adequate skills and 
access to other jobs. 

Neglecting migrants’ needs has enabled 
Chinese cities to grow at low cost. Now this 
strategy is backfiring. For example, the lack 
of security and sense of belonging of incom- 
ing workers in cities makes them reluctant 
to give up their rural land quota, which is 
left uncultivated. They invest their savings 
to build bigger houses back in the villages, 
often on agricultural land. Urban crimes 
and protests for appropriate pay by migrant 
workers are rising. 

Although rural populations are smaller 
than urban ones, and falling, rural residen- 
tial land use has increased rapidly. Four 
times more rural land than urban is being 
used for construction of all sorts, often inef- 
ficiently. More than 7.58 million hectares 
of potential arable land could be realized 
by optimizing rural residential land use — 
clearing up discarded residential plots’, for 
example. Integrating migrating workers into 


URBAN EXPANSION 


The proportion of China's population living in cities has risen steadily since the 1970s (1). Workers moving 
from rural areas to cities are prevented from integrating, however, by a household registration system that 
restricts them from officially changing their permanent residence. Urban populations concentrate in the 


eastern part of the mainland (2). 
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Industrialization of 
western China will raise 
incomes but could 
pollute major rivers. 


urban life — for instance, by guaranteeing 
access to education and providing water and 
sanitation — has costs, but is essential to 
achieving the state’s policy goals and main- 
taining social stability. 

Urbanization is also linked to environ- 
mental and health problems’. About two- 
thirds of Chinese cities are short of water, 
exacerbated by severe surface and ground 
water pollution. Air and water quality in 
cities and the surrounding rural areas are 
deteriorating, with increases in respiratory 
diseases and cancer rates*. In 2013, Beijing 
hit a high for the number of days with smog 
pollution — 46 of the first 100 days of the 
year. Illegal discharge of industrial waste- 
water, which contains heavy metals and 
other pollutants, around cities is blamed 
by some for more than 400 ‘cancer villages, 
in which cancer diagnoses and deaths are 
sometimes 20-30 times higher than the 
national average’. 

Most city development has been in the 
eastern part of the country, particularly 
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36% of the 
nation's land 
supports 96% 
the population. 


along the coast. The procession of urban 
and industrial development inland and 
westwards, to spread manufacturing across 
the country and reduce regional disparities, 
is clearly articulated in the government's 
strategy. But this could bring polluting 
industries into regions where fragile eco- 
systems are providing crucial services to 
the nation. Pollution in western China, for 
example, which is home to the sources of 
the Yangtze, Yellow and Lancang rivers, 
could have profound effects downstream. 

Environmental management strategies 
that stop at cities’ administrative boundaries 
are no longer effective, as epitomized by the 
frequent regional smogs. Officials need to 
identify good ways to collaborate across city 
and regional boundaries. 


LOCAL IMPLEMENTATION 

The National New-type Urbanization Plan 
aims to tackle some of these challenges. 
The household registration constraints 
will be removed or relaxed in towns and > 
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GOVERNMENT TARGETS 

Main indicators and numerical targets in China’s National New-type Urbanization Plan. 

Indicator 2012 2020 
Urbanization level 

Urbanization ratio (resident population) (%) 52.6 60 
Public services 

Proportion of peasant worker’s children accompanying parents receiving 299 
mandatory education (%) 

Basic social-security coverage for urban and township resident population (%)* 66.9 290 
Basic medical insurance coverage for urban and township resident population (%) | 95 98 
Infrastructure 

Public transportation share in total motorized travel in cities with more than 45+ 60 
1 million people (%) 

Public water supply coverage in cities and towns (%) 81.7 90 
Proportion of wastewater treated in cities (%) 87.3 95 
Proportion of municipal waste decontaminated in cities (%) 848 95) 
Broadband Internet connection capacity of urban households 4 250 
(megabits per second) 

Resource and environment 

Per capita urban land use (square metres) <100¢ 
Share of renewable energy consumption in cities and towns (%) 8.7 iS} 
Share of ‘green’ buildings in new constructions in cities and towns (%) Z 50 
Share of prefecture and above level cities that meet the national air-quality 40.9 60 
meeting standards (%) 


*Excludes resident students and people aged 0-16 years. {Data from 2011. +Standard is 65-115 square metres for urban 
built-up area, and 85-105 square metres in newly developed cities. Translated from the National New-type Urbanization Plan 


(2014-2020). 


> cities with populations of up to 5 million. 
Large investments are to be made to eradi- 
cate shanty towns and to provide basic infra- 
structure in the ‘urban villages’ 

A new land-management system will 
increase farmers’ property rights, prevent- 
ing undervalued or illegal land expropria- 
tion by local governments. It will restrict 
rural households from occupying more than 
one residential plot, putting a stop to rural- 
sprawl encroaching on arable land. Local 
governments will be rewarded with central 
funds for integrating rural populations into 
the urban system. 

The devil, however, is in the detail. Some 
of these central government reforms conflict 
with local interests, inviting malpractice. 
Increased duties of care for migrants and 
restricted access to land for development 
will increase the burden on cities and reduce 
their net revenue. 

There are many examples of cities 
behaving opportunistically. The city of 
Wuhan, for example, once known for its 
numerous lakes, has lost 70% of them to 
reclamation and development in the past 
20 years. The Huai River Pollution Con- 
trol Project, a high-profile, target-driven 
initiative led by the central government 
during the 1990s to clean up the heavily 
polluted river basin, failed when it collided 
with local income needs. The main meas- 
ures — shutting down polluting industries 
and imposing stronger regulations on 
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industrial pollutions — were relaxed after 
the target time frame. 

To put people at the centre of policies, 
leaders need to move away from numerical 
indicators such as GDP growth or urbani- 
zation level, which are often pursued at any 
expense. City leaders need to be encouraged 
and rewarded for adopting more responsi- 
ble and sustainable practices, such as taking 
care of migrating workers. 

Local officials must be made more 
accountable to deter malpractice. Better 
methods for evalu- 


ating personneland “Local 
channels for public officials must 
scrutiny of officials be made more 
actions, perhaps on accountable 
the Internet, needto fodeter 

be established. Seri- malpractice. a 


ous breaches of regu- 

lations must be punishable by law, instead 
of reprimanded with mere warnings and 
demotions in the party and the govern- 
ment system. 

Some cities are already doing well, 
exceeding the government's expectations. 
The city of Zhuhai, for example, has been 
spearheading the introduction of regu- 
lations and legal procedures for making 
and implementing urban-development 
plans, promoting public participation 
in the planning process and aligning 
urban-development goals with its social, 
economic, resource and environmental 
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context. Zhuhai topped the sustainable- 
development ranking in the 2013 Urban 
Sustainability Index of Chinese cities. 


WAY FORWARD 
Governments must not shirk the responsi- 
bilities of protecting farmer's rights; provid- 
ing proper social security and basic urban 
infrastructure; and ensuring better environ- 
mental quality and health outcomes in cities 
and surrounding regions. They must address 
risks and hazards such as flooding and smog. 
Rapid urbanization is an enormous 
opportunity to introduce sustainable tech- 
nologies and practices on a national scale. 
The latest plan, for example, aims to drasti- 
cally improve the share of green buildings in 
new construction from 2% in 2012 to 50% 
by 2020, by improving green building codes 
and certificate and accreditation systems, 
making their adoption compulsory, and 
developing green construction materials. 
Efforts must be made to identify innovative 
practices, build up transferable knowledge 
and facilitate cross-city learning”. 
High-level policy intervention, if done 
properly, can be a powerful force for good. 
But to succeed, a tailored and adaptive policy 
approach that engages with local social, envi- 
ronmental, economic and cultural contexts 
and allows local innovation will be crucial. 
The ultimate driver of urbanization is — 
and should continue to be — the aspiration 
of rural people to lead a better life. And that 
will be the ultimate measure of success. Urban 
residents, current and future, must have a say 
in shaping their cities. They are the ones who 
will give the verdict on, and live with, the 
outcome of China's urbanization policy. m 
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IAM/AKG-IMAGES 


Ernst Haeckel’s 1874 drawings track embryo development in pigs, cows, rabbits and humans. 


DEVELOPMENTAL BIOLOGY 


Life in flux 


Renee Reijo Pera enjoys a treatise tracking the rise of 
embryology, from Aristotle to cloning and beyond. 


tion (IVF) in 1978, embryology has 
rarely left the headlines. From cloning 
to regenerative medicine to legal wrangles 
over stem cells and mitochondrial DNA 
replacement, the science of how and when 
human life begins has a powerful fascina- 
tion. In Embryos Under the Microscope, Jane 
Maienschein reminds us that we have always 
argued about those questions. 
Her fascinating tour of them is a balanced 


sion the advent of in vitro fertiliza- 


combination of history and science. We track 
the thoughts of philosophers Aristotle and 
Descartes; follow the development of mod- 
ern experimental embryology by scientists 
such as Frank Lillie; and examine the twen- 
tieth- and twenty-first-century focus on 
understanding the molecular and genetic 
contribution of the sperm, egg and embryo to 
the offspring. Through this, Maienschein — 
director of the Center for Biology and Society 
at Arizona State University in Tempe, and of 
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the centre’s Embryo 
Project — interweaves 
the science of embry- 
ology and the many 
controversies that it 
continues to spark. 
The embryo, the 
product of the fusion 
of sperm and egg, was y 


hypothesized from Embryos Under 
ancient times. Pro- the Microscope: 
ponents of epigenesis The Diverging 
strongly believed that Meanings sd pb 
human life begins rie eats 
gradually, and that press: 9914 


the individual arises 

de novo from potent materials. In the fourth 
century BC, Aristotle became one of the 
theory’s earliest supporters. He was aware 
of some ‘facts of life’ such as that the men- 
strual cycle was linked to reproduction and 
that sexual intercourse involved the mingling 
of male and female fluids. An observer and 
biologist at heart, Aristotle watched chick 
embryos develop, seemingly from amor- 
phous materials to a recognizable form that 
emerged on hatching. As Maienschein makes 
clear, he instinctively understood that what 
he observed might be relevant to humans 
and other animals. Aristotle's ideas illustrate 
the scientific method at its best, in the face 
of great unknowns. Without the benefit of 
microscopy, he put forth concepts regarding 
the earliest stages of development that sur- 
vived for more than 1,500 years. 

The theory of preformation emerged in 
the seventeenth and eighteenth centuries 
along with microscopy. It posited that a pre- 
formed individual was present at conception 
but hidden from view, requiring activation. 
The advent of microscopy allowed scientists 
such as Caspar Friedrich Wolff and Charles 
Bonnet to see the earliest stages of develop- 
ment in many animals. This shifted embry- 
ology from a largely philosophical field to 
an experimental one. The earliest stages of 
species including the chicken, frog, dog and 
rabbit were examined. 

As Maienschein describes, embryology 
grew by leaps and bounds in the nineteenth 
century. Karl Ernst von Baer drew elegant 
pictures of chick-embryo development, and 
Ernst Haeckel compared the development of 
multiple organisms, concluding that perhaps 
“ontogeny recapitulates phylogeny” — devel- 
opment of the individual repeats the evolu- 
tionary history of the group or species. In 
1895, cell biologist Edmund Beecher Wilson 
published An Atlas of the Fertilization and 
Karyokinesis of the Ovum, documenting 
many aspects of fertilization and oogenesis 
and suggesting that in diverse organisms, the 
same types of cell may give rise to the same 
structures. 

Concurrently, genetics and developmen- 
tal biology emerged from embryology > 
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> as scientists sought to explain how 
development might be linked to chromo- 
somes and heredity, as outlined in stud- 
ies by Theodor Boveri and Walter Sutton. 
Further research by embryologist Hans 
Spemann and others probed how tissues 
form and what drives cells to develop 
into different kinds. In recent decades, 
researchers have been able to genetically 
modify organisms, track developmental 
fate and sequence DNA to find naturally 
occurring variants (insertions, deletions 
and even single-base-pair mutations) that 
affect embryo development in flies, fish, 
frogs, worms, mice and even humans. 

Findings in embryology led to a trans- 
formation in reproductive medicine with 
the advent of IVF through the work of 
Patrick Steptoe and Robert Edwards in 
1978. With that came a new focus on 
the value of human life, from the earli- 
est stages and throughout pregnancy. 
Concerns burgeoned, leading in the 
United States to the Dickey-Wicker 
Amendment of 1996, which prohibited 
the use of federal funds for research that 
would create or destroy human embryos. 
Controversies intensified around clon- 
ing technology, beginning in 1997 with 
the work of Ian Wilmut and his team 
on Dolly the sheep, and continuing in 
debates on the ethics of research into 
human embryonic stem cells and their 
potential in cell-based therapies for 
regenerative medicine. 

Embryos Under the Microscope does 
have gaps. It lacks material regarding 
epigenetic reprogramming — the estab- 
lishment of DNA and histone-protein 
modifications in the early embryo that 
allows embryonic cells with identi- 
cal DNA sequences to generate diverse 
cell types — which is a central event in 
embryology. Yet Maienschein covers 
broad territory with surprising depth 
and concision. It seems unlikely that a 
more readable text will soon emerge to 
illuminate the journey from theory to 
observation to ethical considerations in 
this exciting science. 

Lines from T. S. Eliot’s 1942 poem Lit- 
tle Gidding from Four Quartets resonate 
here: “And the end of all our exploring/ 
Will be to arrive where we started/And 
know the place for the first time.” We 
have made remarkable progress in our 
wonderful journey to understand the 
origins of our development. = 


Renee Reijo Pera is vice-president for 
research and economic development at 
Montana State University in Bozeman, 
and a professor in the departments of 
cell biology and neurosciences, and of 
chemistry and biochemistry. 

e-mail: renee.reijopera@montana.edu 
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Daniel Jones in Thetford Forest, UK, the first location for his installation Living Symphonies. 


Q&A Daniel Jones 


Canopy composer 


Sound artist Daniel Jones creates self-generating artworks based on human and natural patterns 
and processes. As he prepares to travel through four UK forests with the installation Living 
Symphonies, a collaboration with artist James Bulley, he talks about music that emerges from 
ecosystem dynamics, and works inspired by bacterial genetics and social networks. 


What is Living Symphonies? 

It is an ever-changing piece of music that 
grows in the same way as a forest eco- 
system, from the interactions of countless 
tiny elements. Each reflects the activity ofan 
individual organism, ranging from moss and 
fungi to deer and birds of prey. The outcome 
is an organic, emergent symphony comprised 
of thousands of musical motifs, each portray- 
ing a different aspect of the ecosystem. 


How do you build each forest installation? 
We start with a survey of the animals and 
plants. We add behavioural insights from 
ecologists: what times of day is a blackbird 
active and foraging, how does it move, what 
are its preferred food sources? We feed this 
information into a computer model of the 
ecosystem (programmed in C++ with the 
Cinder visualization library) that simulates 
the second-by-second 


interactions between _ Living Symphonies 
species. This model DANIELJONES AND 
is linked toa custom M&S SULLEY 

. . Next at Thetford Forest, 
piece ofaudio software 54 39 May 2014 


that orchestrates music — wwwzJivingsymphonies. 
inrealtime fromavast com 
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array of motifs representing each species. The 
music is played through a network of speak- 
ers in the canopy and undergrowth. 


How does the composition sound? 

The motifs for each organism are drawn 
from fragments that we composed and 
recorded with orchestral musicians. Our 
goal is a work with such a great number of 
interdependent elements, in a nearly infinite 
combinatorial space, that even we are sur- 
prised by the patterns and permutations that 
arise. This ‘emergent’ approach runs through 
all of my work. Emergent phenomena per- 
vade economics, ecology, linguistics and 
neuroscience; examples include the flock- 
ing of birds, and the interactions of neurons 
that give rise to cognition. By translating 
the dynamics of a forest ecosystem into 
music, Living Symphonies aims to heighten 
awareness of the adaptive and often creative 
behaviours of these complex systems. 


Have any animals responded to the piece? 

When we ran our first forest prototype 
last autumn, there was some worry that it 
would scare off the wildlife. Yet when we 
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India: shed the bad 
science image 


Contrary to Mathai Joseph and 
Andrew Robinson's implication, 
there are many joys associated 
with doing science in India 
(Nature 508, 36-38; 2014). 

For example, as a theoretical 
physicist, Iam free to pursue 
curiosity- driven science, I 
spend little time writing grant 
proposals and I do not have to 
raise money to fund my group 
members. This fills my foreign 
colleagues with envy. I suspect 
that these colleagues, many of 
whom have made productive 
scientific visits to India, would 
rank Indian science well above 
166th in the world on a different 
scale of research quality. 

Ialso disagree that we need 
“non-resident Indians” to 
partially staff an “empowered 
funding agency”: India already 
has one of the world’s largest 
scientific workforces. To me, this 
proposal smacks of colonialism, 
albeit in a new form, perhaps 
inspired by department closures 
and tenure abolitions in many 
Western universities. 

Finally, I dislike your portrayal 
of the Taj Mahal wrapped in red 
tape to convey the call to ‘Free 
Indian science. Thirty years ago 
a Nature cover also featured a 
Taj Mahal postcard image. That 
seemed inappropriate to me, 
even as an undergraduate, for a 
serious discussion of science in 
India (see Nature 308, 581-600; 
1984). 1am nowa professor and 
Indian science has arrived, along 
with an era of cultural sensitivity 
— but the iconography remains 
unchanged. 

Shobhana Narasimhan Jawaharlal 
Nehru Centre for Advanced 
Scientific Research, Bangalore, India. 
shobhana@jncasr.ac.in 


India: research split 
harmed universities 


One factor that Mathai Joseph 
and Andrew Robinson do not 
consider in their discussion on 
Indian science is the disastrous 


effect of uncoupling education 
and research in the country’s 
universities (see Nature 508, 
36-38; 2014). 

Several research institutes 
were created throughout India 
after it gained independence. 
These took away good students 
from the universities and made 
funding harder to come by, 
causing a decline in research. 

In turn, this diminished the 
number of science faculty 
members in universities and 
marred the spirit of education. 

This is the opposite result to 
that predicted by the celebrated 
Indian spectroscopist 
Chandrasekhara Venkata 
Raman. He declared in the 
1950s that the institutes would 
become “mausoleums of 
science”. He strongly believed 
that the best research could 
only be done by the universities 
(see P. Balaram Curr. Sci. 75, 
977; 1998). 

Biswa Prasun Chatterji 
StXavier’s College, Mumbai, India. 
biswaprasun@gmail.com 


India: overhaul 
university teaching 


We believe that the quality of 
education in India is another 
factor that has been hampering 
scientific progress in the 
country since independence 
(see M. Joseph and A. Robinson 
Nature 508, 36-38; 2014). 

Hundreds of thousands of 
students graduate from Indian 
universities each year. However, 
our own experience in selecting 
students indicates that many 
are ignorant of the basics, with 
underdeveloped reasoning skills 
and an inability to apply the 
knowledge they have. 

The inadequacy of teaching 
methods could explain why none 
of India’s universities is among 
the world’s top 200. Learning 
needs to be more enquiry-based 
and tutorials more demanding; 
syllabuses should be overhauled, 
upgraded and modernized; 
and, to improve accountability, 
student-feedback mechanisms 
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must become more effective. 

It is also important to 
emphasize the value of high- 
quality teaching skills when 
recruiting new faculty members, 
so that the students’ technical 
and reasoning abilities can be 
fully developed. 

Unless India reinvigorates its 
universities by reforming the 
teaching-learning process, it 
will keep failing to generate the 
workforce necessary to propel 
Indian science forwards. 
Dhruba J. Saikia Cotton College 
State University, Assam, India. 
vc@ccsu.ac.in 
Rowena Robinson Indian 
Institute of Technology, Guwahati, 
India. 


Aim for balance in 
Ukraine reports 


Your assessment of the 
scientific implications of 
tensions arising between Russia 
and the United States over the 
Ukraine situation seems to 
reflect a Western political bias. 
This could weaken the very 
research collaborations you 
seek to encourage (see Nature 
508, 149-150 and 162; 2014). 

The language you use to 
describe the Russian regime — 
for example, “Moscow’s violation 
of international law cannot 
be tolerated” and “Western 
observers are stunned by Putin's 
resolute pursuit of power” — 
would come as no surprise from, 
say, US or European media 
with a clear political leaning. It 
seems inappropriate from an 
‘international journal of science. 

A genuinely international 
perspective would not overlook 
the wider political narrative of 
countries such as China, India 
and Brazil, who did not join 
the immediate condemnation 
of Russia. 

Scientific advances in 
international research 
environments generally flourish 
despite cultural and political 
differences. They may not be 
so resilient if suspicion and 
hostility are aroused. 
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Andrew Isaac Meso CNRS 
and Aix-Marseille University, 
Marseille, France. 
andrew.meso@univ-amu.fr 


Help poster sessions 
to grab attention 


Research findings at 
conferences are now more 
often conveyed using posters 
than by oral presentation. Yet 
the development of posters as 
a medium has not kept pace 
with the steady increase in their 
popularity. 

Posters were first used 
to convey scientific results 
45 years ago at the 6th meeting 
of the Federation of European 
Biochemical Societies. Sessions 
were similar to today’s, but were 
shorter and had only about 
15 displays per session, so 
audience engagement was easier. 

The worldwide increase 
in poster presentation as a 
communication medium 
should benefit authors and 
users. Instead, poster authors 
complain of limited exposure for 
their work, resulting in a lack of 
kudos and restricted networking 
opportunities with other 
conference participants. 

Improving audience 
circulation, organizing sessions 
better and providing clear 
presentation guidelines would 
help the results on display to 
capture and hold attention. 
Nicholas Rowe University of 
Lapland, Rovaniemi, Finland. 
nrowe@ulapland. fi 


CORRECTION 

The Outlook article 
‘Neuroscience: Hardwired for 
taste’ (Nature 486, S7-S9; 
2012) incorrectly stated that 

L cells manufacture a glucose 
transporter: it is the absorptive 
intestinal cells that make this. 
Also, the article overlooked the 
contribution of the Shirazi- 
Beechey lab at the University of 
Liverpool in confirming sweet 
and umami receptors in the gut. 
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FORUM: Synthetic biology 


Engineering explored 


Synthetic biology involves the creation of biological systems for new applications by modifying and reassembling biological 
components. Two views are presented here on the best way to engineer these components so that they reliably generate 


organisms with desired traits. 


THE TOPIC IN BRIEF 

@ The aim of synthetic biology is to predictably 
bioengineer organisms that perform 
beneficial functions — from producing 
antibiotics to purifying contaminated water. 

@ One approach is rational design, which 
involves characterizing many biological 
components to generate a library of modules 
that can be assembled within an organism to 
give predictable, reliable outcomes (Fig. 1a). 


Rationalizing 
nature 


PAMELA A. SILVER & JEFFREY C. WAY 


ational design is a mainstay of synthetic- 
biology ideology. What does this term 
actually mean? As proponents of the rational- 
design approach, we believe that human 
designers can use nature to inform the creation 
of genetic circuits, proteins and even life forms. 
A human designer can learn from failures, 
which should ideally inform future designs. 
By contrast, evolution can be seen as a series 
of makeshift solutions, in which each change 
must work at some level for survival — non- 
working intermediates are not permitted. 
Rational design relies on the idea that bio- 
logical systems are fundamentally modular. 
The gene was originally defined as the basic 
biological unit. But the advent of technology 
allowing manipulation of DNA has revealed 
tantalizing levels of modularity that extend 
to many other cellular regulatory elements, 
including promoters, which control gene 
expression, and binding sites for the ribosome 
(the cellular machinery responsible for transla- 
tion), which stimulate protein production. The 
dream of the rational designer is to understand 
these modular parts in sufficient detail to be 
able to assemble them logically, much as an 
engineer would build a machine for a certain 
purpose. There are already success stories — 
for example, assembly of simple genetic cir- 
cuits that rely on the existence of two stable 
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@ An alternative approach is called directed 
evolution, in which genetic mutations of 
unknown impact are introduced into a 
target of interest, generating a library 

of mutants that is screened for desired 
characteristics. Iterative rounds of the 
process produce mutants with optimized 
traits (Fig. 1b). 

@ Opinions differ on which of these methods 
is the more effective. 


states in a system’. Recently, we used these 
principles to design a bacterium that can sense 
whether or not an animal has been exposed to 
antibiotics as it travels through the gut’. 

We are aware, however, that rational design 
alone is unlikely to be a valid strategy for syn- 
thetic biology. Instead, we see it as an over- 
arching approach that will use many tools, 
including knowledge of physical principles 
and natural biological systems, computer 
simulations and more. The synthetic biolo- 
gist’s toolbox will probably contain some level 
of directed evolution or random variation, but 
such strategies are often limited by through- 
put: sometimes animal testing or clinical trials, 
which can be inherently low throughput, are 
the most interesting tests of the outcomes. 

Ideally, in designing a biological system, 
synthetic biologists would understand (and 
apply) all of the phenomena that nature 
manipulates. This would require a detailed 
mastery of gene expression, protein structure, 
enzyme kinetics and so on. But although many 
of these are well understood, the data are gen- 
erated by specialists who often do not interact 
with each other and who frequently do not 
speak a language that can be understood by 
bioengineers. 

Moreover, most of the scientific literature 
has been written with the goal of making 
natural phenomena less mysterious, rather 
than providing guidance for a bioengineer. 


BEYOND DIVISIONS 


The future of synthetic biology 
nature.com/synbio 
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Practical facts and principles are diffusely 
scattered, and buried in an excess of docu- 
mentation. One immediate challenge is to 
consolidate these data into information that 
engineers can use. 

The past 50 years has seen an explosion of 
biological knowledge, funded mainly at pub- 
lic expense. People might assume that such 
knowledge is wasted if it cannot be used to 
rationally design biology for real-world appli- 
cations. Now is the time to make good on the 
public’s investment. 


Pamela A. Silver and Jeffrey C. Way are 

at Harvard Medical School and the Wyss 
Institute for Biologically Inspired Engineering, 
Boston, Massachusetts 02115, USA. 

e-mails: pamela_silver@hms.harvard.edu; 
jeff. way@wyss.harvard.edu 


Evolving with 
purpose 


FRANCES H. ARNOLD 
& JOSEPH T. MEYEROWITZ 


(Caries are great books of instructions 
on ‘how to’: how to extract materials and 
energy and convert them into self-repairing, 
self-reproducing machines; how to function in 
an extraordinary range of environments; even 
how to adapt over the long term. Evolution 
has accumulated a vast library of instructions. 
We argue that it is also a great tool for writing 
new ones. The results of directed evolution are 
rational and predictable, even if the underlying 
genetic changes follow an uncharted course. 
How good are synthetic biologists at writ- 
ing useful biological instructions? We are 
getting better at synthesizing long stretches 
of DNA that can be inserted into a variety of 
organisms in which they will be read and — 
sometimes — acted on. But most of our writ- 
ing merely rearranges passages lifted from the 
few genomes that we have read. We are pla- 
giarists, poor crafters of original literature. We 
cannot yet create an enzyme or a biosynthetic 
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Figure 1 | Two methods for the synthetic engineering of organisms. a, In rational design, biological 
‘parts’ (such as genes, gene regulators and proteins) are studied through modelling and testing such that their 
behaviour is understood. This produces libraries of well-characterized parts — or modules consisting of 
multiple parts — that can be rationally assembled in cells in different combinations, resulting in predictable 
functions. b, An alternative approach is to harness the power of evolution to direct the design of synthetic 
organisms. A region of interest (in this example, a gene) is randomly mutated, producing cells that harbour 
a variety of mutations. The cells are then screened to find those with mutations that result in or improve a 
desired function. Cells that do not harbour such mutations are eliminated from the pool; those that do can 
be cultured and then subjected to new rounds of mutation and screening, to achieve a desired end product. 


pathway that compares favourably with 
nature's engineering outputs. Nature's writing 
is intricate (some say convoluted and opaque), 
but it is effective. We are just learning to hold 
the pencil. 

The reason is simple: in biology, details 
matter a lot, and we don’t understand the 
details. Rational design is hard when one 
cannot even predict the effects of a single 
mutation in a single enzyme, or the full impact 
of adding a single gene to the thousands 
already present in an organism. It is fine to 
hope that ‘modular’ biology is possible, but 
our ham-fisted attempts at assembling bio- 
logical components usually show that biology 
is anything but modular. The engineering of 
systems that tackle real-world problems, such 
as producing an antimalarial drug or an alter- 
native fuel through a new microbial pathway, 
involves years of trial and error’. Realizing the 
potential of synthetic biology requires dealing 
with the details. 

Rational design will not move beyond these 
problems until our understanding of the 
details of biology has improved dramatically. 
Luckily, we do not have to wait. Evolution is 
a time-tested tool for engineering the details, 
and we can use it in the lab to circumvent our 
profound ignorance of how sequence encodes 
function. Accumulating beneficial muta- 
tions over multiple generations is a general 
algorithm for turning a poor copy-and-paste 
job into effective writing, and it works at all 
levels of complexity. From creating a DNA- 
editing enzyme that excises HIV from the 
host genome’, to optimizing multi-enzyme 
biosynthetic pathways’, directed evolution 


has produced results with purpose’. 
Evolutionary engineering is not incompat- 
ible with ‘rational’ design; in fact, the two are 
highly complementary. Directed evolution 
requires a starting design, and the better the 
design, the easier the evolution. Evolution 
also requires a rational search strategy. Where 
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should mutations be targeted, and how many 
of them? How do we measure success along the 
way? Directed evolution and rational design 
are even claiming common ground, for exam- 
ple in mutant libraries that have been designed 
with input from computational processes, 
accelerating the evolutionary process’. 

With a good starting point in hand, evolu- 
tion is the most direct approach to engineering 
the biological world, and it is uniquely effective 
with biological substrates, the products of that 
same process. Biology is highly evolvable, and 
we should exploit that feature to the fullest, as 
we have for thousands of years with everything 
from rats to racehorses. The writer's best friend 
is a good editor — the synthetic biologist’s 
should be directed evolution. m 


Frances H. Arnold and Joseph T. Meyerowitz 
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The origin of regional 
Arctic warming 


Observational data and modelling show that the rapid warming of the northeastern 
Canada and Greenland sector of the Arctic over the past three decades has been 
strongly driven by cooling in the tropical Pacific Ocean. SEE LETTER P.209 


JURGEN BADER 


ver the past 30 years, Earth has become 

a warmer place. One of the most strik- 

ing examples of surface-temperature 
warming is the polar regions in the North- 
ern Hemisphere (Fig. 1). The greater warm- 
ing in the Arctic’, compared with the global 
mean, is associated with a reduction in sea 
ice? and dynamical and radiative feedbacks’, 
and is widely attributed to anthropogenic cli- 
mate change. But the fact that the warming is 
not spatially uniform raises the question of 
whether natural climate variability has a role in 
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driving it and causing regional climate change. 
On page 209 of this issue, Ding et al.* show 
that the most prominent Arctic warming has 
occurred in northeastern Canada and Green- 
land, and that cooling in the tropical Pacific 
Ocean forced half of the warming in these two 
regions. The findings indicate that a substan- 
tial part of regional Arctic climate change is 
therefore a result of natural climate variability. 

Much of the interannual and decadal vari- 
ability in atmospheric climate can be described 
by the evolution of the leading modes of 
climate variability, such as the North Atlan- 
tic Oscillation (NAO). The NAO consists 


8 MAY 2014 | VOL 509 | NATURE | 167 


| RESEARCH | NEWS & VIEWS 


of variations in the difference of 
sea-level atmospheric pressure 
between the Icelandic low-pressure 
system (Icelandic low) and the 
Azores high-pressure system (Azores 
high), and is most pronounced dur- 
ing boreal winter. In the positive 
phase of the NAO, there is a consid- 
erable difference in pressure between 
these two systems, with both the Ice- 
landic low and the Azores high being 
intensified. In the negative phase, the 
two pressure zones are weakened and 
the difference between them is less. 

The NAO is linked to changes in 
the intensity and location of the North 
Atlantic jet stream and storm track, 
and to large-scale temperature and 
precipitation variations over Europe, 
Greenland and North America. It is 
an intrinsic atmospheric phenom- 
enon, but fluctuations in sea surface 
temperature (SST) can also affect it. In 
fact, it can be influenced by both vari- 
ations in local North Atlantic SST and 
remote SST in the tropics’. Changes in 
tropical SSTs lead to changes in con- 
vection throughout the lowest portion 
of the atmosphere (the troposphere) 
at low latitudes, which in turn excite large-scale 
atmospheric waves called Rossby waves. These 
waves can propagate to mid- and high latitudes 
and affect the NAO. 

In their study, Ding et al. demonstrate that 
Rossby waves and the NAO are involved in 
regional Arctic warming. Their finding that 
the Arctic warming in northeastern Canada 
and Greenland since 1979 is strongly driven 
by cooling in the tropical Pacific is supported 
by observational data indicating that warming 
in these two regions is not limited to the sur- 
face but also extends to the upper troposphere. 
The authors argue that it is unlikely that dec- 
adal temperature changes in the upper Arctic 
troposphere are locally forced by variations 
in surface temperature. They suggest instead 
that warming at the surface and in the tropo- 
sphere are the result of atmospheric-circula- 
tion changes in the high troposphere, and that 
these changes are remotely forced. Specifically, 
Ding et al. show that the recent warming in 
northeastern Canada and Greenland is asso- 
ciated with a negative NAO phase driven by 
Rossby-wave activity caused by SST cooling 
in the tropical Pacific. 

These results are confirmed by modelling 
experiments. The authors demonstrate that 
an atmospheric general circulation model 
forced by the observed tropical SST can simu- 
late the connection between tropical Pacific 
SST cooling and regional Arctic tropospheric 
warming. However, they also show that cou- 
pled ocean-atmosphere climate models used 
in the fifth assessment report of the Intergov- 
ernmental Panel on Climate Change fail to 
reproduce the observed regional pattern of 
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Figure 1 | Trend in annual mean surface temperature. The graphic 
shows the observed change per decade of annual mean surface and 
near-surface temperature for the period 1979-2012, based on the 
ERA-interim climate data set. The most marked warming has occurred 
in northeastern Canada, Greenland and north Siberia. (Adapted from 
Extended Data Fig. 1 of ref. 4.) 


Arctic warming. Two plausible reasons for this 
failure are worth mentioning. First, the recent 
cooling in the tropical Pacific can probably be 
attributed to intrinsic variability of the climate 
system’, because it is not simulated by coupled 
climate simulations that incorporate observed 
changes in the concentration of greenhouse 
gases and aerosols. Second, despite continued 
improvements to climate models, it is still a 
challenge to simulate the influence of remote 


SYNTHETIC BIOLOGY 


climatic phenomena correctly. 

By linking cooling in the tropical 
Pacific with trends in atmospheric cir- 
culation and regional Arctic warming, 
Ding and colleagues highlight the com- 
plexity of processes involved in regional 
climate change. Even remote climatic 
fluctuations can have a substantial 
impact. Improving the representa- 
tion of such teleconnections in cli- 
mate models should therefore remain 
a high priority for climate scientists. 
The authors also note the importance 
of natural internal climate variability 
for present and near-future regional 
Arctic climate. But as greenhouse-gas 
concentrations are likely to increase in 
the future, it is only a question of time 
before external forcing dominates 
regional Arctic warming. m 
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Construction of 
a yeast chromosome 


One aim of synthetic biology is to generate complex synthetic organisms. 
Now, a stage in this process has been achieved in yeast cells — an entire yeast 
chromosome has been converted to a synthetic sequence in a stepwise manner. 


DANIEL G. GIBSON & J. CRAIG VENTER 


biological cell is much like a computer 
— the genome can be thought of as 
the software that encodes the cell’s 
instructions, and the cellular machinery as 
the hardware that interprets and runs the 
software. Advances in DNA technology 
have made it possible for scientists to act as 
biological ‘software engineers, program- 
ming new biological ‘operating systems’ into 
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cells. Indeed, in 2010, the entire genome of 
the bacterium Mycoplasma mycoides was 
replaced with a rewritten synthetic genome, 
generating the first synthetic cell’. Now, ina 
paper published in Science, Annaluru et al. 
describe how they have begun rewriting the 
genome of a more complex organism, that 
of the yeast species Saccharomyces cerevisiae. 
The researchers report the design and gen- 
eration of a functional, synthetic chromo- 
some in this yeast, a milestone that they have 


had in their sights for almost ten years. 

A quote by the theoretical physicist Richard 
Feynman, “What I cannot create, I do not 
understand’, has inspired synthetic biologists 
around the world. But this does not necessarily 
mean that what can be created is understood. 
There is an enormous gap between our abil- 
ity to build DNA and our understanding of 
the instructions it encodes. The production 
of a synthetic chromosome in yeast (the first 
such achievement in a eukaryote — the class 
of organisms comprising plants, animals and 
fungi) represents a step towards closing that 
gap. Generation of synthetic versions of widely 
used model organisms such as S. cerevisiae will 
enable scientists to investigate the require- 
ments of life in eukaryotes, because synthetic 
organisms can be easily manipulated. 

The process began at the computer, where 
Annaluru and colleagues downloaded the 
publicly available DNA sequence ofa S. cere- 
visiae chromosome (chromosome III). Next, 
they designed genetic changes, which can be 
thought of as software edits, with the aim of 
introducing specific alterations into the chro- 
mosome. These edits included the deletion 
of dispensable DNA sequences; the incor- 
poration of unique sequences to enable the 
researchers to differentiate between natural 
and engineered DNA; and the replacement of 
one particular region of DNA that terminates 
gene transcription with another that performs 
the same task. 

In addition, the authors flanked each non- 
essential gene with DNA sequences designed 
to cause deletion of the flanked genes on a 
given signal. This flanking allows the size of the 
chromosome to be reduced, a feature designed 
to help determine the smallest cohorts of genes 
required to perform a given function or nec- 
essary for survival under a particular growth 
condition. This information is crucial if we are 
to write biological software in a more predict- 
able fashion, thus ensuring that cells can be 
engineered to reliably carry out the tasks they 
have been programmed to perform. 

To generate the synthetic chromosome, 
Annaluru and co-workers broke down their 
designer DNA sequence into overlapping 
stretches of 70 nucleotides, which were chem- 
ically synthesized in parallel. Students in the 
Build-a-Genome course at Johns Hopkins 
University in Baltimore, Maryland, stitched 
together these DNA stretches into constructs 
approximately 3 kilobases long, using estab- 
lished DNA-assembly methods**. 

Next, the original chromosome sequence 
was systematically replaced with the synthetic 
DNA in vivo, by introducing up to 12 overlap- 
ping 3-kb synthetic fragments at a time into the 
yeast cell, in 11 successive rounds of integration. 


BEYOND DIVISIONS 


The future of synthetic biology 
nature.com/synbio 
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Figure 1 | Building synthetic genomes. a, To generate a synthetic bacterial cell, the entire synthetic 
bacterial genome is assembled in a yeast cell. In this setting, the bacterial genome is inactive because the 
yeast cell lacks the proteins required to turn on bacterial genes. After synthesis is complete, the synthetic 
genome is transferred to a compatible bacterial host cell, where it is activated and produces a synthetic cell. 
b, By contrast, Annaluru et al.” have generated a synthetic yeast chromosome within the yeast cell itself. To 
do this, they replaced the natural chromosome with chunks of synthetic DNA in a stepwise manner. 


The product of this work is a yeast strain that 
contains an extensively engineered chromo- 
some III, and that grows just as well as the orig- 
inal strain. The 273-kb designer chromosome 
contains more than 50,000 sequence alterations, 
and is 14% shorter than the natural sequence. 

This work is important because it begins 
to address unanswered questions about how 
genome design can be used to manipulate the 
rules of biology in a eukaryotic model. For 
example, can superfluous DNA between genes 
be removed? Can unnecessary genetic code be 
altered? So far, the sequence alterations made 
by Annaluru and colleagues have not reduced 
the fitness of the yeast, which bodes well for 
future modifications. 

If two or more genes perform a similar func- 
tion, can one be deleted? Which combinations 
of the 5,000 yeast genes that are thought to be 
dispensable can be simultaneously removed? 
The authors have already begun cataloguing 
genes that can be simultaneously deleted with- 
out adversely affecting the fitness of the yeast. 
Answering these fundamental questions is a 
goal that the synthetic-yeast group shares with 
efforts to recode the genome of the bacterium 
Escherichia coli’, and with current research 
programmes" that aim to understand better 
how to build bacterial genomes containing 
only the genes necessary to sustain life. 

Two bacterial genomes have previously 
been chemically synthesized — the 583-kb 
genome of Mycoplasma genitalium’ and the 
1,078-kb M. mycoides genome’ — and so 
synthesis of a chromosome of 273 kb is not in 
itself unusual. The genomes of both M. geni- 
talium and M. mycoides were assembled and 
propagated in yeast, acting as an extra chromo- 
some. Like incompatible Macintosh software 
in a Windows computer, synthetic bacterial 
genomes are unable to ‘boot up’ in yeast- 
cell machinery, and so cannot produce self- 
replicating bacterial cells in this setting. In 
a process called genome transplantation”, 
the complete bacterial genome must be 
moved to a bacterial host with compatible 
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hardware, converting the recipient cell into a 
new, synthetic species (Fig. 1a). By contrast, 
Annaluru et al. gradually converted the natu- 
ral yeast chromosome into a fully functional 
designer chromosome within their target 
species, the yeast cell itself (Fig. 1b). 

In comparison to the completely synthetic 
M. mycoides cell’, chromosome III accounts 
for less than 3% of the yeast genome. Now the 
question is: can these design rules be success- 
fully applied across the entire yeast genome? 
The scientists attempting to generate syn- 
thetic yeast cells still have a long way to go 
before they make a fully reprogrammed yeast 
genome. However, by demonstrating the suc- 
cess of their design principles and assembling 
an international team of scientists to build 
the remaining 15 chromosomes, Annaluru 
and colleagues have laid the groundwork for 
making this happen in the near future. 

Advances such as these are of interest to the 
entire field of synthetic biology. Each innova- 
tion not only enhances our general understand- 
ing of biology, but also creates a framework on 
which we can build and expand, allowing us to 
work towards the goal of basing our economy 
on synthetic biology — a development that 
will have a positive impact on all of society. m 
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A virtual Universe 


A numerical simulation of cosmic structure formation reproduces both large- and 
smaller-scale features of a representative volume of the Universe from early in its 
history to the present day. SEE ARTICLE P.177 


MICHAEL BOYLAN-KOLCHIN 


cosmology is that a model with only six 
parameters can explain the vast majority 
of observational data from the first minutes 
of the Universe to the present day’. This stand- 
ard model posits that 95% of the Universe 
today is composed of enigmatic ‘dark matter 
and ‘dark energy. Paradoxically, modelling 
the dynamics of the remaining 5% — normal, 
‘baryonic’ matter — has proved to be the more 
challenging task. On page 177 of this issue, 
Vogelsberger et al.” describe a numerical sim- 
ulation of the formation of cosmic structure 
that captures both the large-scale distribution 
of baryonic material and its properties in indi- 
vidual galactic systems through cosmic time. 
Tracking the evolution of baryonic matter 
is a daunting undertaking because of the huge 
range of physical scales involved in the pro- 
cesses that shape galaxies and larger structures 
(Fig. 1). To cover a representative portion of 
the Universe, cosmologists must study cosmic 
volumes that are at least 100 million parsecs 
(326 million light years) across. By contrast, 
the natural scale of star formation is approxi- 
mately 1 parsec, and accretion of gas by super- 
massive black holes occurs on even smaller 
scales. Numerical simulations have long been 
the tool of choice for tackling these problems. 
But even with the most powerful supercom- 
puters, it has been impossible to run a simu- 
lation large enough to model the large-scale 


Pp erhaps the greatest triumph of modern 


distribution of gas, stars and dark matter while 
keeping sufficient detail to accurately capture 
individual galaxies. It turns out that simulating 
the Universe can be a difficult endeavour. 

Vogelsberger and colleagues attack this 
problem from all sides. Their simulation — 
named Illustris — incorporates more than 
10 billion individual cells to represent the gas in 
the simulation volume; this is nearly a tenfold 
increase on its predecessors. The numerical 
code’ used to perform their simulation employs 
a new approach, based on an unstructured 
and adaptive computational mesh that follows 
fluid flows, to solve the equations that describe 
the time evolution of baryonic matter within 
cosmic structures. And finally, the physical 
phenomena included in the simulation are rich 
and complex: the authors consider gas cooling, 
stellar evolution, energy input from supernova 
explosions, production of chemical elements, 
and gas accretion by supermassive black holes 
(along with accompanying radiative feedback), 
to list just a few. 

If this all sounds somewhat complicated, 
do not be fooled: it is extremely complicated. 
Many of these processes are not understood 
from first principles, and they interact in com- 
plex, nonlinear ways. Additionally, the relevant 
physical scales are often (much) smaller than 
can be directly resolved, even with Illustris. 
This requires computationally efficient mod- 
els that accurately encapsulate the underlying 
physics. Running the simulation was therefore 
no mean feat: it took approximately 16 million 


CPU (central processing unit) hours. The end 
result, however, is a simulated Universe that 
looks an awful lot like the real one. 

A mock observation of Illustris set to mimic 
the Hubble Ultra Deep Field’, the deepest pic- 
ture of the cosmos ever taken, can easily pass for 
the real thing when the two are viewed side by 
side (see Fig. 1b, c of the paper’). Images of gal- 
axies from the simulation are also impressively 
realistic (see Fig. 1a of the paper), an accom- 
plishment that has previously been possible 
only for simulations of individual galaxies. This 
is not just window dressing: such procedures 
allow direct and meaningful confrontation 
of theory with data. A wide array of quantita- 
tive measures agrees with observations of the 
real Universe as well. For example, previous 
generations of simulations had great difficulty 
capturing the observed distribution of elements 
heavier than hydrogen and helium contained 
in stars. Illustris reproduces these observations, 
not just for the Universe as a whole, but also as 
a function of galactic stellar mass. In addition, 
the simulation matches the abundance of these 
heavier elements in dense gas clouds. 

Of course, Illustris does not mark the end 
of cosmological simulations of galaxy forma- 
tion. Although its computational volume is 
immense, it is not large enough to model the 
formation of rare cosmological objects (for 
example, powerful black holes observed in the 
early Universe). And the level of detail is still 
not fine enough to study the faintest galaxies 
surrounding the Milky Way. Star formation in 
low-mass galaxies occurs earlier and faster in 
Illustris than in the real Universe, a difficulty 
shared by almost all models of galaxy forma- 
tion’. However, such issues point the way for 
future advances. 

One clear goal for observers and theorists 
alike is to understand in detail the ways in 
which energy and momentum from evolving 
and exploding stars affect the properties of gas 
in and around galaxies*’. A promising compu- 
tational approach is to combine large-volume 


Figure 1 | The multi-scale nature of galaxy formation. Studying the 
formation of galaxies such as the spiral galaxy Messier 81 (left inset) requires 
simulating their large-scale environments (background image) as well as 
understanding the birthplaces of individual stars (right inset). The scale of the 
background image is approximately 150 million parsecs, and each inset image 
zooms in by a factor of approximately 10,000. It is this huge range in scales that 
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makes ab initio modelling of galaxy formation such a challenge. Vogelsberger 
et al.” were able to simulate a representative volume of the Universe while 

still resolving details for individual galaxies. (Background image: M. Boylan- 
Kolchin et al./Max Planck Inst. Astrophys.; left inset: NASA/JPL-Caltech/ESA/ 
Harvard-Smithsonian CfA; right inset: NASA, N. Walborn & J. Maiz-Apellaniz 
(STSI), R. Barba (La Plata Observatory).) 


simulations such as Illustris, which are necessarily 
limited in the level of detail they can resolve, 
with simulations that focus all of their power 
on individual galaxies, which sacrifice the ability 
to study galaxy formation in a statistical manner. 
If the knowledge gained from these focused 
simulations can be incorporated into large- 
scale computational efforts, ever more accurate 
numerical realizations of the Universe should 
be possible, guiding our investigations into the 
underlying physical processes. 
Computationally, the ability to reach the 


BIODIVERSITY 


scales necessary to model star formation directly, 
while encompassing thousands of galaxies 
similar to the Milky Way, is still a distant dream. 
But thanks to the Ilustris simulation, creating a 
realistic virtual Universe of gas, stars, black holes 
and dark matter is already a reality. m 
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Supply and demand 


Data from Himalayan songbirds suggest that the rate-limiting step in biodiversity 
production may not be the speed of speciation, but rather the speed at which new 


niches are created. SEE LETTER P.222 


ARNE 0. MOOERS 


r There are currently about 10,000 named 
bird species on the planet, but only half 
as many mammals. What dictates the 

number of species in any given group? In this 

issue, Price et al.’ (page 222) marry the ecolo- 
gies of bird species living in close proximity in 
northern India with their ages to argue that it 
is not the rate at which new species form but 
the rate at which new niches are created that 
limits how many species actually persist. By 
building a new evolutionary tree linking all 

358 songbird species inhabiting a Himalayan 

slope, the authors find that it has taken an aver- 

age of 7 million years for related species to end 
up together on the landscape. 

This seems a long time: bird speciation 
generally involves geographic separation and, 
from beginning to end, it usually takes less than 
3 million years for species-level mutual disdain 
to evolve; new species can populate adjacent 
landscapes and join their close relatives in the 
blink ofa geological era. That said, species that 
occupy the same niche rarely occur together in 
nature. So Price and colleagues argue that the 
roughly 4 million extra years it seems to take 
for species to establish themselves alongside 
their relatives means they have had to wait that 
long for available niches to arise. This suggests 
that finding a new role in nature may be the 
limiting step in diversification. 

Given the ambiguous signals that evolu- 
tionary trees have offered us so far, using a 
partial evolutionary tree linking coexisting 
species to make an argument about large- 
scale processes is audacious. Previous studies 
that have compared the lengths of recent and 
older branches on evolutionary trees to meas- 
ure diversification’ generally have found that 
more-recent branches are too long relative to 


expectations from simple evolutionary models. 
Such data often suggest both that extinction 
is non-existent (which, of course, is not true) 
and that diversification has slowed towards the 
present’. 

There are at least three reasons for this odd 
pattern of slowing diversification, and Price 
et al. consider them all. Mundanely, inferring 
trees is hard, and recent branches on evo- 
lutionary trees may be biased to be too long 
relative to older branches*. Luckily, the authors 
could compare their tree to a recent and inde- 
pendently derived one that includes nearly all 
the same species”. Crucially, although the two 
trees differ greatly in total age and shape, they 
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agree almost exactly on the 7-million-year 
time lag for the coexistence of relatives. 

More interestingly, an apparent slowdown in 
diversification is expected because speciation 
takes time: we miss more-recent diversification 
events because we do not recognize separated 
populations that will eventually become new 
species as new species’. This means that recent 
speciation events have not yet been recorded, 
so recent branches will be longer than they 
should be. Many evolutionary trees are well 
described by such a process’. However, Price 
and colleagues sidestep this issue with a par- 
ticularly nimble piece of logic. Because most 
speciation takes place when populations are 
separated geographically, and given that line- 
ages that occupy the same niche rarely coexist, 
there can be few or no unrecognized species 
in a sample of species living in proximity. 
Nascent species may exist elsewhere, but the 
recent branches of these authors’ tree are 
certainly not too long because of a species- 
recognition problem. 

Most intriguing is to consider the possibility 
that slowdown is acommon fate in diversifica- 
tion. Following the invasion of relatively empty 


High elevation 


Figure 1 | Himalayan songbird assemblage. Although bird species generally take less than 3 million 
years to form, Price et al.' show that closely related species (those joined by the recent (red) branches of 
the evolutionary tree) that are found together in the landscape are separated by about 7 million years — 
these recent branches are much longer than most of the deeper (blue) branches. Their tree also reveals 
that closely related bird species tend to occupy different elevations; there are more species at mid- 
elevations where there is more food; and variation in songbird size and shape is highest at high elevations, 
which are species-poor and therefore competition-poor. These patterns are all consistent with a model in 
which the ultimate brake on diversification is ecological opportunity. 
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niche space — by dispersal to a new region 
or by the evolution of new ways to make a 
living — an initial flurry of speciation fills up an 
initial set of niches, but then new species 
become established only when others go 
extinct or new niches are created’. 

Such an ecological-demand model (in 
contrast to evolutionary supply) also fits 
many evolutionary trees well’, and Price and 
colleagues’ data on songbirds tally with this 
story (Fig. 1). Other data from the region 
suggest that recent ecological opportunity 
may be linked to changes in elevation, and 
Price et al. indeed find that close relatives 
differ most often in the elevation at which 
they live. The well-known pattern of highest 
species richness at mid-elevations’ is, in their 
data, associated with more food (and so more 
niches) rather than with more speciation. And 
songbirds span a greater range of sizes and 
shapes at species-poor high elevations, per- 
haps owing to release from competition with 
other groups of birds that could not establish 
themselves. 

There is little question that an ecological- 
demand model of biodiversity offers up a 
grand narrative: nascent species arise in 
ample supply as isolated populations, but 
these expand their ranges and fill landscapes 
only if, and only when, those landscapes have 
ecological room for them. Many new species 
might therefore wither on the vine and never 
reach this expansion phase. This model implies 
a decoupling of the rate of speciation from 
the rate of subsequent establishment, a novel 
pattern that has recently been documented for 
flies and birds”. 

An ecological-demand model also raises 
several questions. Evolutionary biologists have 
long appreciated that close relatives compete 
strongly''. However, we have only recently re- 
appreciated that speciation is often intimately 
linked to diverging ecologies’*. Therefore, to 
evaluate the ecological-demand model, we 
need to know the relative contributions of 
ecological’ and non-ecological speciation” to 
biodiversity production. And, given that the 
model suggests that the number of species in 
a group is related to niche availability’, we 
need to understand both why there are more 
niches for some groups (say, birds) than for 
others (say, mammals), and whether it is really 
true that, at any one time, many landscapes are 
full to the brim with biodiversity. m 
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Diversity in the afterlife 


Field experiments that varied the composition of both plant litter and the 
organisms that break it down have revealed that, across ecosystems, 
lower biodiversity slows the rate of litter decomposition. SEE LETTER P.218 


JENNIE R. MCLAREN 


espite our increased awareness of 
Distress loss, and attempts to 

respond to it, the global rate of species 
extinction does not seem to be slowing’. Con- 
cern about how these losses may affect the way 
that ecosystems function has led to decades of 
experiments looking for connections between 
the properties of ecosystems and the numbers 
and types of species that live in them. Using 
field experiments and meta-analyses, ecolo- 
gists have determined that decreased diver- 
sity reduces the efficiency of resource capture 
by communities and the conversion of those 
resources to biomass’. Most biomass created 
by plants is not consumed by herbivores, but 
becomes ‘litter’ after a plant dies. Do the effects 
of diversity die with the plants? On page 218 
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of this issue, Handa et al.’ suggest that the 
answer is no, by showing that both the diversity 
of dead plants and the diversity of organisms 
that decompose them contribute to determin- 
ing how quickly this material is recycled back 
into ecosystems. 

Although experiments and meta-analyses 
have given us confidence in our predictions 
of the effects of mixing together living plants, 
experiments in which different litter types are 
mixed rarely show predictable results**. Meta- 
analyses of these studies have been limited by 
different experimental designs, site-specific 
effects and the potential influence of the type 
rather than the number of species. Handa and 
collaborators attacked this problem using a 
series of coordinated field experiments, at sites 
ranging from the sub-Arctic to the tropics, 
in which they manipulated the diversity of both 
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Increasing rate of litter breakdown 


Figure 1 | Composition and decomposition. Handa et al.’ show that increasing the diversity of either 
plant litter or detritivores — invertebrates and microorganisms that break down litter — independently 
increases rates of litter breakdown. Simultaneously increasing both litter and detritivore diversity results 


in the largest increases in decomposition rate. 
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plants and detritivores — the invertebrates 
and microorganisms that break down litter. 
They found that a reduction in the diver- 
sity of either slows the rate at which litter is 
decomposed, regardless of the location of the 
experiment (Fig. 1). 

Decomposition rates are an important 
determinant of the global carbon budget, 
affecting not only the incorporation of vegeta- 
tion carbon into soils, but also the early stages 
of its release back to the atmosphere. That 
Handa et al. found effects of diversity on this 
process across biomes and ecosystems suggests 
a remarkable consistency in the consequences 
of biodiversity loss. 

The authors manipulated detritivore diver- 
sity by varying the types of container the litter 
was decomposed in, and thus the organisms 
that could access and break down the litter. 
They show that the diversity of these organ- 
isms had even more influence than the com- 
position of the litter community itself — the 
more types of detritivore, the faster the litter 
was decomposed. This finding provides exper- 
imental support for the results of a meta-analy- 
sis° that concluded that the effects of consumer 
diversity on decomposition are generally more 
important than the effects of resource diversity. 

Handa and colleagues’ study is also one of the 
first to provide evidence for a potential mecha- 
nism driving the effects of higher litter diversity: 
the movement of nitrogen between different 
litter types. Laboratory studies had previously 
shown that such nitrogen movement was 
possible’, but whether it could occur in natu- 
ral settings, and whether it would lead to 
changes in decomposition, was unknown 
until now. 

Although nitrogen movement between 
leaves had been predicted to follow a simple 
gradient — from leaves with more nitrogen to 
leaves with less — it may instead be driven by 
carbon quality. If microbes grow quickly on 
litter from plants with high-quality carbon, 
this would increase the microbes’ need for 
nitrogen, and when it is not available locally 
they may access it from the neighbouring 
litter. Handa et al. found evidence for nitrogen 
movement between litter types across biomes 
and ecosystems, and always between the same 
two litter types (from nitrogen-fixing litter 
species to rapidly decomposing ones). This is 
an exciting result that will increase our abil- 
ity to predict the effects of biodiversity loss on 
carbon cycling and storage. 

However, there is still no direct evidence 
for nitrogen translocation — in the latest 
experiments, it was inferred from patterns 
in nutrient loss or gain as the litter decom- 
posed. The researchers report that when the 
nitrogen-fixing and fast-decomposing litter 
species are decomposed together, the amount 
of nitrogen present in one goes up while the 
amount in the other goes down, which is 
best explained by nitrogen moving between 
species. But it is possible that the nitrogen 
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moving into the litter is coming from else- 
where, including the surrounding soil. Given 
that the predictive power of Handa and col- 
leagues’ results hinges on knowledge about 
underlying mechanisms, a key next step will 
be to confirm the implied mechanism with 
direct tests in the field. One possibility would 
be to perform experiments using litter with 
radiolabelled nitrogen in the field, which have 
so far been done only in the lab. 

We are increasingly aware that understand- 
ing the impacts of biodiversity loss requires 
recognizing that not all species are at equal risk 
of extinction®. The identity of species lost has 
been shown to have large effects on plant pro- 
ductivity” and decomposition’, which results 
in a range of potential outcomes for differ- 
ent extinction scenarios. Thus, being able to 
accurately predict the effects of species loss 
at a broader scale will require both generaliz- 
able rules linking certain types of species to 
the effects of their loss, and knowledge of the 
mechanisms that cause those effects. 

Handa and colleagues’ study brings us closer 
to that goal by providing three things: general 
patterns linking litter and detritivore diversity 


STRUCTURAL BIOLOGY 


to decomposition; evidence for a mechanism 
driving that link; and evidence that these 
patterns and mechanism are the same regard- 
less of whether they occur in an Arctic stream 
or a forest in the tropics. With this informa- 
tion in hand, we can strive towards linking 
patterns to realistic extinction scenarios, 
and thus predict probable outcomes of 
biodiversity loss. m 
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Action at a distance 
in a light receptor 


A tour de force of X-ray scattering has yielded structures of a phytochrome 
photoreceptor in its dark and illuminated states, showing how localized protein 
refolding magnifies a light signal to form a cellular message. SEE LETTER P.245 


ANNA W. BAKER & KATRINA T. FOREST 


ensor proteins allow organisms to perceive 

and appropriately respond to environ- 

mental changes. They are molecular 
machines comprised of sensory domains and 
effector modules; a specific signal is received 
by the former and communicated to the latter 
through structural changes, creating an out- 
put activity that directs a cellular response. 
Phytochromes are dimeric sensor proteins 
that specifically absorb red and near-infrared 
light using a covalently tethered chromophore 
molecule housed in a light-sensitive (photo- 
sensory) core. In this issue, Takala et al.! 
(page 245) demonstrate how visible light is 
interpreted and spatially magnified by the 
phytochrome dimer in a chain reaction that 
uses key features of the protein’s three-dimen- 
sional structure. 

Phytochromes regulate most of the responses 
of plants to light, including germination, 
flowering and shade avoidance, and are thus 
fundamental to agriculture’. They are also 
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widespread in microbes’, and phytochromes 
found in cyanobacteria are thought to be ances- 
tors of those in plants”. The proteins signal by 
switching between two stable conformations, 
which correspond to the dark and illuminated 
states. Absorption of light by the chromophore 
alters a local network of hydrogen bonds and 
van der Waals interactions, and ultimately 
induces the illuminated conformation. 

How these minute changes (of the order of 
angstroms) are transferred through the pro- 
tein to regulate the distant effector domain is 
a long-standing puzzle. One proposed model, 
also applicable to blue-light receptors and 
sensors that react to chemical signals, involves 
the rotation of central helices connecting the 
sensory and effector modules*. In phyto- 
chromes, a structural element known as the 
tongue also spans the two modules and is 
likewise expected to be involved in signal 
transduction’. 

The inherent dynamism of the dark and 
illuminated states confounds structural studies 
of phytochromes. Several dark-state crystal 
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Figure 1 | Signal magnification in phytochrome light sensors. a, Light-induced isomerization of 

the chromophore in phytochromes causes atomic-scale structural changes within picoseconds of 
illumination. b, Takala et al.' report that these changes lead to rearrangement of a crucial secondary- 
structure element in the ‘tongue’ region of the phytochrome: a B-hairpin unravels and an a-helix forms. 

c, Within 30 milliseconds, the closed quaternary structure adopted in the dark state opens to a Y-shaped 
conformation in the illuminated state, with effector domains (structure unknown) at the distal tips of the Y. 
The phytochrome’s characteristic figure-of-eight knot may rigidify the chromophore-binding domain, 
thereby restricting light-driven motion to the hinge in the central helix of each dimer subunit. The two 
subunits of the phytochrome are shown in orange and yellow; red squares indicate approximate times and 


length scales of the changes. 


structures of domains of the photosensory 
core have been solved”, but finding a way 
to consistently trap the phytochrome in the 
illuminated state for crystallography has 
been a major obstacle. Takala and colleagues 
overcame this problem by taking advantage 
of the fact that the phytochrome from the 
bacterium Deinococcus radiodurans relaxes 
slowly in vitro from its illuminated state to 
its dark conformation. The authors therefore 
delivered periodic red-light pulses to growing 
crystals of the protein, to enrich the popula- 
tion of molecules in the illuminated state, and 
thereby enable a stable crystal of the sensor in 
that conformation to be grown. This allowed 
low-resolution X-ray structures to be solved 
for the complete photosensory core in the 
dark and illuminated crystalline states. 

Next, the researchers performed molecular 
dynamics simulations based on the crystal 
structures to generate structures of the pro- 
tein in solution. These were validated by 
comparison with X-ray-scattering data for 
the photosensory core in solution. Together, 
the crystal and solution structures support a 
model in which chromophore isomerization 
resulting from red-light absorption ruptures 
adjacent hydrogen bonds, causing a B-hairpin 
structural feature in the tongue to unfold and 
nearby atoms in the same malleable structural 
unit to wind into an a-helix. The tongue thus 
contracts by 2.5 A to relieve a sharp (approxi- 
mately 50°) bend at a hinge in the central helix. 
The resulting illuminated state has a distinct 
Y shape, in which the arms of the Y are as much 
as 8 nanometres apart. In an experimental 


tour de force, the authors combined time- 
resolved X-ray-scattering data collected over 
several timescales in solution, and found that 
the phytochrome magnifies light-induced 
atomic-scale changes (produced within 
picoseconds‘; 1 picosecond is 10°"’s) to the 
nanometre scale in milliseconds (Fig. 1). 

Models of signal transduction in which 
the central helix of the phytochrome rotates 
are not supported by these findings, but are 
supplanted by a scenario in which the helix 
straightens in response to refolding of the 
tongue when illuminated. This explains why 
two tongue conformations have previously 
been seen for phytochromes™. The new model 
is supported by earlier observations’ of light- 
induced effects near the chromophore, and 
plausibly connects structural changes from 
the atomic to the nanometre scale. A central 
tenet of the mechanism is that the secondary 
structure of the tongue changes, but could it be 
that the observed conformations occur only in 
crystals, and not in solution? Such concerns are 
largely allayed by the triangulation of results 
obtained using molecular dynamics, solution 
scattering and protein crystallography. 

The tongue-refolding model requires that 
dramatic movements within the protein are 
manifested through a plastic region in the 
central helix; for this to occur, kinetic energy 
must not be misdirected into random motion 
throughout the photosensory core. Phyto- 
chromes contain an unusual figure-of-eight 
knot’, and we propose that this limits the num- 
ber of degrees of conformational freedom avail- 
able to the protein when an energetic red-light 
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photon is absorbed. This rigidity presumably 
funnels the protein’s response to the appropri- 
ate changes in secondary structure along the 
chromophore-tongue-effector-domain trajec- 
tory so that signal transduction can proceed. 

Takala and co-workers provide a picture of 
phytochrome signal transduction that sug- 
gests a mechanism for how the activity of the 
effector domain is regulated by red light. In 
D. radiodurans, the phytochrome effector 
module is a histidine kinase. In the illumi- 
nated state, the ends of the two Y arms each 
bear one of these enzymatic modules. We 
infer that the trajectory of the central helix 
observed in each monomer of the illumi- 
nated structures continues through the kinase 
domain, although this remains to be proven 
experimentally. In any case, strain induced 
by straightening of the long central helix will 
force the effector modules into a new orienta- 
tion relative to one another. 

Histidine kinases act by transferring a 
phosphate group to their substrate across a 
subunit-subunit interface’®. In the case of the 
phytochrome, it is logical to conclude that 
enzymatic activity occurs in the dark, when the 
active site of an effector module is close to the 
phosphorylation site of its partner, but halts in 
the light, when the modules are separated”. 
Unfortunately, crystallization of dynamic his- 
tidine kinases suffers from similar roadblocks 
to those associated with phytochrome crys- 
tallization. A complete phytochrome struc- 
ture at near-atomic resolution in both states 
therefore remains the ultimate goal in the field, 
along with accompanying enzymological and 
physiological studies to support any structural 
model that shows how local structural changes 
are magnified into global ones. = 
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Properties of galaxies reproduced by a 
hydrodynamic simulation 


M. Vogelsberger', S. Genel’, V. Springel**, P. Torrey’, D. Sijacki?, D. Xu’, G. Snyder®, S. Bird’, D. Nelson? & L. Hernquist” 


Previous simulations of the growth of cosmic structures have broadly reproduced the ‘cosmic web’ of galaxies that we 
see in the Universe, but failed to create a mixed population of elliptical and spiral galaxies, because of numerical in- 
accuracies and incomplete physical models. Moreover, they were unable to track the small-scale evolution of gas and 
stars to the present epoch within a representative portion of the Universe. Here we report a simulation that starts 12 
million years after the Big Bang, and traces 13 billion years of cosmic evolution with 12 billion resolution elements in a 
cube of 106.5 megaparsecs a side. It yields a reasonable population of ellipticals and spirals, reproduces the observed 
distribution of galaxies in clusters and characteristics of hydrogen on large scales, and at the same time matches the 


‘metal’ and hydrogen content of galaxies on small scales. 


The initial conditions for structure formation in the Universe are tightly 
constrained from measurements of anisotropies in the cosmic micro- 
wave background radiation’. However, previous attempts to reproduce 
the properties of the observed cosmological structures with computer 
models have shown only limited success. No single, self-consistent sim- 
ulation of the Universe was able to simultaneously predict statistics on 
large scales, such as the distribution of neutral hydrogen or the galaxy 
population of massive galaxy clusters, together with galaxy properties 
on small scales, such as the morphology and detailed gas and stellar con- 
tent of galaxies. The challenge lies in following the baryonic component 
of the Universe using hydrodynamic simulations” *, which are required 
to model gas, stars, supermassive black holes (SMBHs) and their related 
energetic feedback. The vast computational challenges of these simula- 
tions have forced previous attempts to focus either on simulating only 
small portions of the Universe or to employ a coarse resolution such that 
internal characteristics of galaxies could not be resolved. Furthermore, 
the large dynamic range of length scales in the problem forces common 
hydrodynamic solvers to sacrifice accuracy. Finally, poorly understood 
small-scale processes such as star formation and accretion onto SMBHs 
are coupled to galactic and super-galactic scales, introducing large uncer- 
tainties into modelling techniques. 

Rapid advances in computing power combined with improved nu- 
merical algorithms and more faithful models of the relevant physics 
have allowed us to produce a simulation (named Illustris) that simul- 
taneously follows the evolution of dark matter and baryons in detail. 
Starting approximately 12 million years (Myr) after the Big Bang, our 
simulation tracks the evolution of more than 12 billion resolution ele- 
ments in a volume of (106.5 Mpc)’ up to the current epoch (redshift 
z = 0). This allows us to achieve a dark-matter mass resolution of 6.26 X 
10° times that of the Sun (M<), and a baryonic mass resolution of 
1.26 X 10°M«. The smallest scale over which the hydrodynamics is re- 
solved is 48 pc, whereas gravitational forces are resolved down to 710 pce 
at z = 0 and to even smaller scales at high redshifts (for example, 473 pc 
at z = 2). Our calculation therefore overcomes the problems of previous 
hydrodynamic simulations which either did not cover a large enough 
portion of the Universe to be representative, lacked adequate resolu- 
tion, or failed to reach the present epoch. Apart from having a large 


volume and improved resolution, our simulation is evolved with the novel 
hydrodynamic algorithm AREPO’, which uses a moving unstructured 
Voronoi tessellation in combination with a finite volume approach 
(Methods). Finally, we employ a numerically well-posed and reasonably 
complete model for galaxy formation physics, which includes the for- 
mation of both stars and SMBHs, and their effects on their environments 
in forms of galactic super-winds driven by star formation, as well as 
radio bubbles and radiation proximity effects caused by active galactic 
nuclei (AGNs; see Methods). 

Unlike previous attempts, we find a mix of galaxy morphologies 
ranging from blue spiral galaxies to red ellipticals, with a hydrogen and 
‘metal’ (that is, all elements other than hydrogen and helium) content 
in good agreement with observational data. At the same time, our model 
predicts correctly the large-scale distribution of neutral hydrogen, and 
the radial distribution of satellite galaxies within galaxy clusters. Our 
results therefore demonstrate that the A cold dark matter (ACDM) model 
can correctly describe the variety of observational data on small and 
large scales in our Universe. It also predicts a strong, scale-dependent 
impact of baryonic effects on the dark-matter distribution, at a level 
that has significant implications for future precision probes of cosmology. 


Observing the model Universe 


The simulation volume contains 41,416 galaxies at z = 0 that are re- 
solved with more than 500 stellar resolution elements. Our model yields 
a population of non-star-forming elliptical galaxies, star-forming disk 
galaxies, and irregular galaxies (Fig. 1a). We find that galaxies with low 
star-formation rates contain about 52% of all the stellar mass that is 
in galaxies more massive than M+ = 10°Mo, which agrees well with 
observations® (54%-60%). Simulating the formation of realistic disk 
galaxies, like our own Milky Way, has remained an unsolved problem 
for more than two decades. The culprit was an angular momentum deficit 
leading to too high central concentrations, overly massive bulges and 
unrealistic rotation curves’*. The fact that our calculation naturally pro- 
duces a morphological mix of realistic disk galaxies coexisting with a 
population of ellipticals resolves this long-standing issue. It also shows 
that previous futile attempts to achieve this were not due to an inherent 
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flaw of the ACDM paradigm, but rather due to limitations of numer- 
ical algorithms and physical modelling. 

As our simulation follows the evolution of galaxies starting rela- 
tively shortly after the Big Bang, we can construct virtual mock obser- 
vations that mimic the conditions that the Hubble Space Telescope 
encounters as it images galaxies across cosmic time in very deep surveys, 
such as the Ultra Deep Field (UDF), and the continuing Frontier Fields 
programme. These observations capture a large variety of galaxy lumi- 
nosities, sizes, colours, morphologies and evolutionary stages, provid- 
ing remarkable benchmarks for galaxy formation theories. We have 
constructed a mock UDF and compare it side by side with data from 
the HST eXtreme Deep Field (XDF) compilation” (Fig. 1b, c). Galaxies 
in the mock UDF appear strikingly similar to the observed population 
in terms of number density, colours, sizes and morphologies. Our model 
is the first hydrodynamic simulation from which a faithful deep UDF- 
like observation could be constructed, thanks to its combination of large 
volume and high resolution, along with the new numerical techniques 
that allow it to reproduce realistic galaxy morphologies. 


Satellite galaxies in clusters 

This qualitative agreement also extends to many quantitative probes 
of the distribution and internal structure of galaxies. The abundance 
and spatial distribution of satellite galaxies forms an important obser- 
vational test of the ACDM paradigm, as it is very sensitive both to details 
of the hierarchical structure formation process and to galactic-scale 
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Figure 1 | Mock images of the simulated galaxy population. a, Stellar light 
distributions (g, r, i bands) for a sample of galaxies at z = 0 arranged along the 
classical Hubble sequence for morphological classification. Our simulation 
produces a variety of galaxy types, ranging from ellipticals to disk galaxies to 
irregular systems, the last mostly resulting from interactions and mergers. 

b, HST UDF image (2.8 arcmin on a side) in B, Z and H bands convolved with 
Gaussian point-spread functions of o = 0.04, 0.08 and 0.16 arcsec, respectively. 
c, HST mock observation from IIlustris. 
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baryonic processes. In fact, two of the most acute challenges to the ACDM 
modelare related to satellite galaxies on galactic scales, giving rise to the 
‘missing satellite’ and ‘too-big-to-fail’ problems". A particularly tax- 
ing problem is the radial distribution of satellite galaxies within galaxy 
clusters, which are the largest gravitationally bound objects in the Universe. 

Recently, large Sloan Digital Sky Survey cluster samples have made 
it possible to measure accurate radial satellite profiles’*’’. Theoretical 
models have so far struggled to reproduce the shapes and normaliza- 
tions of these profiles. Semi-analytic models have resorted to ad hoc 
prescriptions for mass-stripping and tuned approximations for satellite 
orbits because they are missing the gravitational effects of the stellar 
component. Despite the freedom offered by the adopted coarse para- 
meterizations, most semi-analytic models consistently find radial pro- 
files that are too steep, and in particular overestimate the number of 
satellites in the inner regions of clusters (radius r < 100-200 kpc)'*"’. 
Previous hydrodynamic simulations, on the other hand, found inner 
radial profiles that are too shallow and in most cases could not repro- 
duce the observed normalization'*“*, owing both to limited resolution 
and to an over-production of stars in satellites due to missing physical 
processes. 

For the most massive haloes, which host the largest accumulations 
of galaxies in the Universe, we have calculated a stacked projected gal- 
axy count profile and compared it to results from a sample’? of satellite 
galaxies with r-band magnitude <—20.5, for host systems at 0.15 =z 
= 0.4 extracted from 55,121 Sloan Digital Sky Survey groups and clus- 
ters (10°°’Ma < Msoo,crit < 10'°°M 5, where Ms00,crit is the character- 
istic mass enclosed in a radius with mean density 500 times the critical 
density for closure) centred on luminous red galaxies (Fig. 2). Having 
higher resolution than previous hydrodynamic simulations, and more 
realistic feedback models that suppress the stellar masses of satellites, 
we obtain a good agreement with both the observed profile shape and 
its normalization, as well as with the mean galaxy colour as a function 
of cluster-centric distance’. The radial distribution of dark-matter sub- 
haloes from a corresponding dark-matter-only simulation (‘Illustris- 
Dark’) flattens towards the centre. This difference is due to dissipational 
processes of galaxy formation that make the stellar component more 
resistant to tidal disruption close to cluster centres. This directly dem- 
onstrates that neglecting baryonic physics causes inaccuracies in the 
spatial distribution of satellite galaxies which in turn can lead to errors 
in estimates of galaxy merger rates. 


Metals and neutral hydrogen in galaxies 


The high mass resolution of the simulation makes it possible to study 
the internal characteristics of galaxies. It allows us, for example, to make 
clear predictions for the stellar and gaseous contents of galaxies, and 
for the baryonic cycle operating between them. Our model predicts the 
present-day HI mass, Myy, of galaxies in the local Universe, which can 
be compared to observations as revealed by the Arecibo Legacy FAST 
ALFA Survey (ALFALFA; Fig. 3a). As stars form out of cold gas which 
is largely neutral, the H1 richness is a good probe of the reservoir of 
gas available for star formation in a galaxy. The H I-selected samples of 
ALFALFA are typically biased towards the most gas-rich star-forming 
galaxies; that is, the major limitation of H 1-selected samples is that they 
will miss the most gas-poor elliptical galaxies. 

Our results recover the trend of decreasing H1 richness with in- 
creasing stellar mass. For the most massive galaxies, the full simulated 
galaxy sample deviates from the observed mean relation. Many of these 
massive galaxies have been quenched through AGN feedback and there- 
fore contain little amounts of gas; that is, they would not be detected by 
ALFALFA given their low H 1 content. If we focus instead on star-forming 
galaxies, we find significantly better agreement even at higher masses, 
and we reproduce the observed H 1 richness relation over nearly four orders 
of magnitude in stellar mass. We also include a separate H I richness re- 
lation for the satellites of the most massive cluster and compare this 
to H1 measurements of galaxies in the Virgo cluster”® where ram pres- 
sure removes gas from infalling galaxies (Fig. 3a). Such hydrodynamic 
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Figure 2 | Projected number density profile of satellite galaxies in 

galaxy clusters. Symbols show a simulation sample as a function of 
cluster-centric projected radius, considering 10 haloes more massive than 
Msoo,crit = 10'*’Mo and all their satellite galaxies brighter than r< —20.5 mag 
within 3 X Rsoo,crit along the line of sight from the cluster centre. Me is the 
projected galaxy number density, r, is the projected distance from the halo 
centre and </rsgocrit> is the mean radius, which encloses Msoo,crit- The symbol 
colour reflects the average (g — r) colour for galaxies at that distance (see key). 
A reddening towards the cluster centre is clearly visible. Observational data‘ 
are shown for comparison (green line), while the dashed line gives the subhalo 
profile of the corresponding dark-matter (DM)-only simulation (normalized 
to match the satellite profile at large radii). The left blue region marks the area 
of incompleteness in the observed number density profiles due to obscuration 
from the brightest cluster galaxy (BCG). The green shaded region marks the 
observational uncertainties: outside the region of contamination of the BCG 
the scatter within the observational stack dominates, whereas Poisson errors 
are shown within that region. 


processes are not directly accessible to semi-analytic models but are cap- 
tured in our simulation. We can therefore recover the observed trend: 
satellites in cluster environments have a lower HI content compared to 
the whole population. The predicted difference is not as pronounced as 
in the observations, but this may well be caused by the fact that our volume 
does not contain a galaxy cluster as massive as Virgo. 

Every dynamical time, a small fraction of the galactic cold neutral 
gas turns into stars, which inherit the chemical composition of the gas. 
Stellar metallicities therefore probe different processes intrinsic to gal- 
axy formation: chemical enrichment, feedback, and in- and outflows. 
Hydrodynamic cosmological simulations have so far been unable to 
produce correct stellar metallicities; measures such as the mean stellar 
metallicity or the cosmic density of total metal mass locked up in stars, 
Pz,*, were discrepant from the observed values by almost a factor of 
two!?, Our model, on the other hand, produces pz+ = 7.75 X 10°Mo 
Mpc *, in agreement with the observed value” of pz= (7.1 + 2) X 
10°Mo Mpc ®. 

Recent observations find a relation where more massive galaxies con- 
tain a larger proportion of metals****. Our predictions agree well with 
these observations, and, most importantly, the simulation recovers the 
flattening of the relation above Mx ~ 10'°Me (Fig. 3b). The observed 
trend is reproduced over nearly five orders of magnitude in stellar mass. 
These agreements with observations are driven by the combination of 
our accurate hydrodynamic scheme together with realistic effective feed- 
back models and a complete stellar evolution prescription, a combina- 
tion that was lacking in previous studies. 
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Figure 3 | Neutral hydrogen and metal content of galaxies as a function of 
their stellar mass. a, Predicted H1 richness of galaxies (My, is the mass in 
neutral hydrogen) compared to the «.40 sample of the ALFALFA survey. Filled 
regions mark bins with more than 20 galaxies (red, ALFALFA; green, all 
galaxies; blue, star-forming galaxies). Error bars indicate s.e.m. for individual 
galaxies. A separate H1 richness relation for the satellites of the most massive 
cluster is included and compared to HI measurements of Virgo satellites”? to 
demonstrate the strong effect of environment on the H1 content of satellites. 
b, Stellar metal content of simulated galaxies, Z, in units of solar metallicity, 
Zo (blackline and grey shaded squares), compared to observations ”° (green). 
Reassuringly, the trend of the median relation including the flattening above the 
stellar mass Ms ~ 10''Mo is well recovered. The green shaded region 
represents the s.d. The error bars represent the s.e.m. for individual galaxies. 


Large-scale characteristics of neutral hydrogen 

The space between galaxies, the intergalactic medium, is filled with low- 
density, warm gas, mainly composed of ionized hydrogen. Observa- 
tionally, the intergalactic medium can be probed through the forest of 
Lyman-« absorption lines in quasar spectra. The corresponding dis- 
tribution of hydrogen column densities is well-constrained*””* over 
a wide dynamic range. At high gas densities, hydrogen becomes self- 
shielded from the ionizing ultraviolet background and forms dense 
neutral clouds with a distinctive spectral absorption signature, the so- 
called damped Lyman-z absorbers (DLAs), which are observed mostly 
at redshift z~ 2-5 and probe initial stages of galaxy formation. 
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Early hydrodynamic simulations successfully predicted the low- 
column-density statistics of the Lyman-« forest”’, whereas describing 
the main properties of higher-density absorbers and DLAs correctly only 
became feasible over the past few years*”*’. However, recent numerical 
calculations have failed to explain the metallicity distribution of DLAs, 
which is now reasonably well constrained by data’. The outcomes 
of most simulations gave metallicities for DLAs that are too high or 
yielded a metallicity distribution that is too broad, in disagreement with 
observations****, 

We contrast our predicted H 1 column density distribution function 
(CDDF; Fig. 4a) and the metal content of DLAs (Fig. 4b) with observations. 
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Figure 4 | Large-scale characteristics of neutral hydrogen. a, Column density 
distribution function (CDDF), f(Niy), of neutral hydrogen at z = 3 compared to 
observations”**”. The dashed vertical line shows the density threshold 
separating damped Lyman-o systems (DLAs) and Lyman limit systems. The 
shaded region shows an estimate for the CDDF constrained by the assumption 
of a power law fit and the observed incidence of Lyman limit systems. The 
vertical error bars represent the s.e.m., and the horizontal ones represent the 
binning. b, Probability density function of the DLA metallicity in units of solar 
metallicity at z= 3 compared to observational findings’. The observational 
vertical error bars show the s.e.m. derived from the number of observed spectra 
in each metallicity bin. Bin widths have been chosen to be larger than the 
maximal uncertainty in each individual metallicity measurement (horizontal 
error bars). 
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For the DLA metallicities, we compare to an observational compendium”, 
based on all available quasars between z = 2 and z = 4, whereas the CDDF 
data are centred at z = 3. The properties of DLAs are sensitive to the 
balance between gas accretion, outflows and ionization. Probing such 
a complex interplay of processes is a particular strength of hydrodyn- 
amic simulations. The prediction of our model is in remarkable agree- 
ment with the observational data, reproducing in detail the shape and 
location of the metallicity peak. This success is a result of accurate hy- 
drodynamics and modelling of galactic super-winds resulting in metal 
outflows. Also, we find good agreement between our theoretical pre- 
dictions and the observed CDDF. 

At low redshift, most of the baryons have not yet been detected. Im- 
proved instrumental sensitivity may lead to their detection in the fore- 
seeable future*, at a level depending on exactly where they reside, and 
at what temperature, which is a matter of theoretical debate. On the 
basis of our model we predict that, at z = 0, gas that is not bound to 
any haloes constitutes 81% of the total baryons in the Universe, but 
contains only 34% of the heavy metals. 


The impact of baryons on dark matter 
The discussion thus far has stressed that a direct modelling of baryonic 
physics is essential to soundly connect cosmological predictions to gal- 
axy observations. Even beyond that, baryonic processes can actually 
affect and modify the dark-matter distribution, and therefore alter the 
matter power spectrum P(k). Measuring P(k) as a function of wave- 
number, k, provides a powerful cosmological probe, because theoret- 
ical models can be used to connect the present-day P(k) to the initial 
power spectrum. Such measurements come, for example, from weak 
lensing studies, galaxy clustering surveys and the analysis of the Lyman- 
a. forest. However, the interpretation of upcoming weak lensing surveys 
such as EUCLID, which will measure P(k) on scales of 0.1h Mpc! <k 
< 10h Mpc ', must consider the impact of baryons in order to achieve 
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Figure 5 | Nonlinear matter power spectrum. a, The dimensionless total 
matter power spectrum, A’(k), of the Illustris simulation (black line) differs 
significantly, owing to baryonic effects, from that of the dark-matter-only 
counterpart Illustris-Dark (light blue). Analytic fitting models*”° (green and 
pink) do not provide an adequate description of the hydrodynamic results. The 
theoretical shot noise level (shown as thin dashed lines) has been subtracted in 
the measurements. b, The relative differences between Illustris and Illustris- 
Dark highlight the fact that baryonic effects already exceed 1% on scales smaller 
than k= 1h Mpc". Dashed black lines mark 1%, 10% and 100% relative 
differences. 
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the required level of ~1% accuracy**’”. This effect has often been ne- 
glected, even though it is not restricted to just the smallest spatial scales. 
We have calculated the dimensionless matter power spectrum, A(k) 
= kP(k)/(20°), along with the corresponding dark-matter-only result 
(Fig. 5). The dynamic range of our simulation allows us to probe P(k) 
on a wider range of scales than possible before through a single hy- 
drodynamic simulation. On scales smaller than k ~ 1h Mpc", AGN- 
driven outflows reduce the total power in a scale-dependent way by 
up to ~30%-40% compared to the dark-matter-only prediction. This 
impact is even larger than found in previous studies*, which is most 
probably related to the strong radio-mode AGN feedback in our simu- 
lation needed to match the stellar mass content of massive haloes. On 
smaller scales, gas cooling processes become important and enhance 
the power on scales smaller than k ~ 100h Mpc_'. Here, the power spec- 
trum can deviate by a factor of a few compared to collisionless results. 
Measuring this fundamental statistic precisely hence requires high- 
resolution hydrodynamic simulations in large volumes; it cannot be 
done by dark-matter-only simulations or semi-analytic modelling. We 
also present in Fig. 5 the predictions of commonly employed empirical 
fitting models for the nonlinear evolution**” of P(k). Those models have 
been calibrated on dark-matter-only simulations and clearly fail to des- 
cribe the results of full hydrodynamic calculations, rendering their use 
impractical for the precision requirements of upcoming surveys. 


Looking ahead 


Although our simulation provides a significant step forward in mod- 
elling galaxy formation by reproducing simultaneously many dispar- 
ate observations on large and small scales, there are still outstanding 
problems. One such problem lies in the formation of low-mass galaxies: 
our simulation tends to build up the stellar mass of low-mass galaxies 
below Mx ~ 10'°M 5 too early, resulting in stellar populations that are 
too old, with mean ages a factor of two to three larger than observed. 
This tension is shared by semi-analytic models and other recent hydro- 
dynamic simulations alike, pointing towards an open problem in low- 
mass galaxy formation*’. It remains to be seen whether new stellar 
feedback models that, for example, include effects of radiation pres- 
sure on dust can resolve this issue”. It will clearly be challenging to test 
new schemes that also directly treat the stellar radiation fields with sta- 
tistically meaningful samples of galaxies, as this poses extremely high 
computational demands that go significantly beyond what was achieved 
in the present work. Nevertheless, such new generations of large-scale 
high-resolution hydrodynamic simulations might become feasible with- 
in the next decade. 


METHODS SUMMARY 


The equations of gravity and hydrodynamics are evolved using the moving-mesh 
code AREPO® combined with a galaxy formation model which includes®: gas cool- 
ing with radiative self-shielding corrections; star formation; energetic feedback from 
growing SMBHs and exploding supernovae; stellar evolution with associated chem- 
ical enrichment and stellar mass loss; and radiation proximity effects for AGNs. 
Our AGN feedback consists of three components: thermal quasar-mode feedback; 
thermal-mechanical radio-mode feedback; and radiative feedback. The efficiency 
of physical processes below the resolution scale are treated in parameterized form 
and have been calibrated to reproduce the observed global efficiency of star forma- 
tion. Our simulation assumes a ACDM cosmology with the parameters Q,,, = 0.2726, 
Qy = 0.7274, Q, = 0.0456, og = 0.809, n, = 0.963 and Hy = 100hkms | Mpc ', 
with h = 0.704. Here Qn, Qa, and Q are respectively the mass/energy density 
contributions from matter, the cosmological constant and baryons, h encodes the 
Hubble expansion rate at redshift zero, n, is the spectral index of the primordial 
power spectrum and gg is the root mean squared amplitude of mass fluctuations 
in 8h_' Mpc spheres linearly extrapolated to redshift zero. Initial conditions are 
generated at z = 127 ina periodic box witha sidelength of 75h ' Mpc ~ 106.5 Mpc. 
The initial gas temperature at z = 127 is set to 245 K. We have also performed a 
second simulation, Illustris-Dark, which does not include baryons and the related 
feedback, but is otherwise identical to Illustris. Dark-matter haloes were identified 
in an on-the-fly manner during the simulation for each snapshot using a friends- 
of-friends (FOF) algorithm with a linking length of 0.2 times the mean particle 
separation. The minimum dark-matter particle number was set to 32 for the FOF 


ARTICLE 


identification. Non-dark-matter particles are attached to these FOF primaries in a 
secondary linking stage“. Subsequently, gravitationally bound substructures are 
identified using the SUBFIND algorithm**’. 16 million CPU hours were needed 
to evolve the simulation from the starting redshift z = 127 to z = 0, using 8,192 
cores and an equal number of MPI-ranks. An additional 3 million CPU hours were 
spent on carrying out the on-the-fly galaxy identification with the SUBFIND algorithm. 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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METHODS 


Simulation code. Our simulation code, AREPO, uses an unstructured Voronoi 
tessellation of the simulation volume, where the mesh-generating points of this 
tessellation are moved with the gas flow. The adaptive mesh is used to solve the 
equations of ideal hydrodynamics with a finite volume approach using a second- 
order unsplit Godunov scheme with an exact Riemann solver. This approach is 
under most circumstances superior to traditional smoothed particle hydrodynamics 
(SPH), and also to Eulerian adaptive mesh refinement (AMR)***'. This scheme 
naturally produces extended disk galaxies without invoking extreme forms of stellar 
feedback or star formation”, which was a major problem of previous galaxy forma- 
tion simulations. The gravity calculation employs a Tree-PM scheme’, where long- 
range forces are determined with a particle-mesh method (PM) while short-range 
forces are computed via a hierarchical tree algorithm”. 

Galaxy formation physics. Our simulation accounts for a variety of astrophysical 
processes known to be relevant for galaxy formation”. 

Gas cooling rates are calculated as a function of gas density, temperature, metal- 
licity, the radiation fields of active galactic nuclei and the spatially uniform but 
time-dependent ionizing background radiation from galaxies and quasars*’, which 
completes H1 reionization at a redshift of z ~ 6. We use a self-consistent calcula- 
tion of the primordial cooling and complement it with the cooling contribution of 
the metals, based on pre-calculated cooling rate tables using CLOUDY™. All cool- 
ing rates include self-shielding corrections”. 

We employ a sub-resolution model of the interstellar medium to achieve nu- 
merical closure below our resolution scale. Gas with hydrogen number density above 
0.13 cm” * follows an effective equation of state with a stochastic prescription for 
star formation following the Kennicutt-Schmidt law” and adopting a Chabrier 
initial mass function’. The effective equation of state assumes that the interstellar 
medium has a two-phase structure that is predominantly composed of cold clouds 
embedded in a tenuous, supernova-heated phase™. 

Once stellar populations are born, they can lose mass, for example through stellar 
winds or supernovae. This mass is returned to the gas phase and enriches the gas 
surrounding stellar populations. We track the evolution of stars and model super- 
novae of type Ia, type II and the asymptotic giant branch phases of stars. We trace 
the evolution of nine elements in total (H, He, C, N, O, Ne, Mg, Si, Fe), each ad- 
vected as a passive scalar. 

Stellar feedback is realized through a kinetic wind scheme with a velocity scaling 
based on the local one-dimensional dark-matter velocity dispersion (3.70pm,1p)> 
anda mass loading inferred from energy conservation assuming 1.09 X 10°! erg per 
type II supernova. We use a sub-grid metal-loading scheme that regulates the degree 
of wind enrichment such that 40% of the local interstellar medium metals are ejected 
by supernova-driven galactic winds. This is required to simultaneously reproduce 
the stellar mass content of low-mass haloes and their gas oxygen abundances”. 

We include procedures for supermassive black hole (SMBH) seeding, accretion 
and merging”. Feedback from SMBHs operates in either a quasar-mode or a radio- 
mode, depending on their accretion rate®. In addition, a prescription for radiative 
SMBH feedback is included that modifies the ionization state and hence the net 
cooling rate of nearby gas. SMBHs are seeded in friends-of-friends groups more 
massive than 7.1 X 10'°M. with a seed mass of 1.4 X 10°Mo. 

The free parameters of our model are set to physically plausible values and have 
been adjusted within the allowed range to roughly reproduce the relation between 
mean stellar mass and halo mass inferred from abundance matching analysis. The 
resulting parameter settings have been tested on smaller-scale simulations’ and 
high-resolution zoom-in simulations of individual Milky Way-like haloes™. 
Initial conditions. To create initial conditions we use the Boltzmann code 
CAMB*** to compute the linear power spectrum of a ACDM cosmology with 
the parameters Q,, = 0.2726, 2, = 0.7274, Q, = 0.0456, og = 0.809, n, = 0.963 
and Hy = 100hkms~' Mpc ' with h = 0.704. These parameters are consistent with 
the latest Wilkinson Microwave Anisotropy Probe (WMAP)-9 measurements”, 
but slightly offset from the first year results of the Planck mission’. However, a recent 
re-analysis of the Planck data found parameters more consistent with pre-Planck 
cosmic microwave background analyses and astronomical observations®. 

We create a random realization of this cosmology in periodic boxes with a side 
length of 75h”! Mpc ~ 106.5 Mpc, starting from an initial ‘glass-like’ particle con- 
figuration” composed of one thousand 182° particle tiles. We employ a 3,640? fast 
Fourier transform to calculate the displacement field and use Lagrangian perturba- 
tion theory (Zel'dovich approximation”) to displace particles, and we de-convolve 
the input power spectrum for smoothing effects due to the interpolation off this 
grid. Initial conditions are generated at z = 127 with mesh-generating points added 
to the initial conditions by splitting each original particle into a dark matter and gas 
cell pair, displacing them with respect to one another such that two interleaved 
grids are formed, keeping the centre-of-mass of each pair fixed. The initial gas tem- 
perature at z = 127 is set to 245 K based on a RECFAST”” calculation. We have 
generated 100 different random fields and inspected their power spectra and mass 
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functions at z = 0 to make sure that we do not simulate an unusual or extreme den- 
sity field that is dominated by a few large clusters or voids due to cosmic variance. 
Simulation details. The simulation volume contains initially 6,028,568,000 hydro- 
dynamic cells and the same number of dark-matter particles resulting in a dark- 
matter mass resolution of 6.26 X 10°M, anda baryonic mass resolution of 1.26 X 
10°Mo. The gravitational softening length of dark-matter particles is fixed in comov- 
ing coordinates (py = 1h kpc). For baryonic particles (stars and SMBHs), we 
limit the softening length to a maximum physical scale (éparyon = 0.5h i kpc). Gas 
cells use an adaptive softening length tied to their cell radius with a floor given by 
the softening length of the collisionless baryonic particles. We employ a (de-) re- 
finement scheme which keeps the cell masses typically within a factor of two of a 
specified target mass set to 1.26 X 10°Mo, anda regularization scheme steering the 
mesh towards a computationally efficient centroidal configuration’**“*. The smal- 
lest cells in Illustris have a typical extent of 48 pc. For the least massive cells we 
achieve a mass resolution of 1.5 X 10*Mo. 

Galaxy identification. Dark-matter haloes were identified in an on-the-fly man- 
ner during the simulation for each snapshot using a friends-of-friends (FOF) 
algorithm” with a linking length of 0.2 times the mean particle separation. The 
minimum dark-matter particle number was set to 32 for the FOF identification. 
Non-dark-matter particles are attached to these FOF primaries in a secondary link- 
ing stage“*. Subsequently, gravitationally bound substructures are identified using 
the SUBFIND algorithm****. We derive the various galaxy properties from the grav- 
itationally bound mass that is contained within a radius r« that equals twice the 
stellar half-mass radius of each SUBFIND (sub)halo. Using this definition, the ga- 
lactic stellar mass does not significantly differ from the total stellar mass for low- 
mass systems, but some of the intra-cluster light for massive systems is excluded. 
We have checked this definition against surface brightness cuts in different bands 
and find it to give similar results to more elaborate methods for excluding intra- 
cluster light. We have extended this radius by 50% for measuring the galactic H1 
masses. Stellar population synthesis models” were used to associate the stars in our 
simulation with observable broad band luminosities. Producing stellar images of 
galaxies requires assigning colour values to specific bands. Specifically, we make an 
RGB mapping of the (g,r,i) bands using a commonly employed asinh scaling”®. 
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FXR is a molecular target for the effects of 
vertical sleeve gastrectomy 


Karen K. Ryan’, Valentina Tremaroli*, Christoffer Clemmensen’”, Petia Kovatcheva-Datchary’, Andriy Myronovych’, 
Rebekah Karns’, Hilary E. Wilson-Pérez!, Darleen A. Sandoval’, Rohit Kohli*, Fredrik Baickhed® & Randy J. Seeley! 


Bariatric surgical procedures, such as vertical sleeve gastrectomy (VSG), are at present the most effective therapy for the 
treatment of obesity, and are associated with considerable improvements in co-morbidities, including type-2 diabetes 
mellitus. The underlying molecular mechanisms contributing to these benefits remain largely undetermined, despite 
offering the potential to reveal new targets for therapeutic intervention. Substantial changes in circulating total bile acids 
are known to occur after VSG. Moreover, bile acids are known to regulate metabolism by binding to the nuclear receptor 
FXR (farsenoid-X receptor, also known as NR1H4). We therefore examined the results of VSG surgery applied to mice 
with diet-induced obesity and targeted genetic disruption of FXR. Here we demonstrate that the therapeutic value of 
VSG does not result from mechanical restriction imposed by a smaller stomach. Rather, VSG is associated with increased 
circulating bile acids, and associated changes to gut microbial communities. Moreover, in the absence of FXR, the ability 
of VSG to reduce body weight and improve glucose tolerance is substantially reduced. These results point to bile acids and 
FXR signalling as an important molecular underpinning for the beneficial effects of this weight-loss surgery. 


Perhaps surprisingly, the most effective and durable therapies for the 
treatment of obesity involve surgical, rather than pharmacological or 
behavioural, intervention. In contrast to the modest impact of diet 
and exercise, most individuals that lose weight with bariatric surgery 
maintain reduced levels of body fat for many years'. This is associated 
with considerable improvements in co-morbidities. Notably, 40% of 
obese type-2 diabetes mellitus patients that undergo bariatric surgery 
achieve full remission within one year’. Despite its efficacy, however, 
surgery is not an attractive therapeutic option for many individuals 
suffering from obesity and its consequences. Therefore it is both scien- 
tifically and clinically imperative that we identify molecular mechan- 
isms responsible for weight loss and other metabolic improvements so 
we can target affected pathways in a less-invasive manner. 

In the bariatric procedure VSG, approximately 80% of the stomach 
is removed along the greater curvature. This creates a gastric ‘sleeve’ in 
continuity with the oesophagus and duodenum. VSG induces loss of 
body weight and fat mass, and improves glucose tolerance in humans 
and in rodents**. Importantly, the efficacy of VSG is comparable to 
the more complex Roux-en-Y gastric bypass (RYGB), which not only 
creates a smaller stomach but also involves surgical re-routeing of the 
small intestine**. A recent randomized controlled trial found that VSG 
and RYGB produced nearly the same remission of type-2 diabetes mel- 
litus after one year, and that both procedures were significantly superior 
to medical management’. 

Conventional wisdom holds that bariatric surgeries, including VSG 
and RYGB, lead to weight loss and resolution of type-2 diabetes melli- 
tus directly as a result of reducing the ability of the stomach to physi- 
cally accommodate a meal. That is, the stomach is made smaller, perhaps 
imposing a mechanical restriction on the amount of food that can be 
consumed at one time. Moreover, in RYGB, the flow of nutrients and 
biliopancreatic secretions is re-routed, perhaps limiting macronutrient 
absorption. However, growing evidence indicates that restriction and 


malabsorption are not the primary mechanisms driving metabolic improve- 
ments after bariatric surgery®. As just one example, a substantial pro- 
portion of diabetic patients are able to stop taking their medications 
within days of surgery, before substantial weight loss has occurred’. 
Given the short time frame, some controversy exists about the mechan- 
isms underlying this dramatic outcome. Some hypothesize that acute 
improvements are independent of weight loss and result from changes 
in gut physiology, including altered neurohumoral signalling or altered 
microbial ecology. In contrast, others suggest these benefits are the result 
of the hypocaloric post-operative milieu and rapid weight loss”*. 
Among the changes to gut physiology that occur following bariatric 
surgery is altered enterohepatic circulation of bile acids. Both RYGB and 
VSG are associated with a significant increase in circulating total bile 
acids in humans and in rodent models®”®. It is now clear that, in addi- 
tion to aiding the mechanical digestion and absorption of lipids, bile 
acids also bind to FXR to function as signalling molecules contributing 
to the regulation of various metabolic processes. In light of this role, we 
hypothesized that FXR-signalling links altered bile acid homeostasis to 
important post-operative changes in metabolism and gut microbial 
communities, thereby contributing to the maintenance of weight loss 
and improvements in glucose control observed following VSG. To test 
this hypothesis, we applied VSG surgery to diet-induced obese mice with 
targeted genetic disruption of FXR and their wild-type littermates. 


Unbiased pathway analysis 


We used sequencing of messenger RNA (mRNA-seq), together with sub- 
sequent unbiased pathway analysis, to identify key biological pathways 
that were altered in the distal small intestine following VSG (Fig. 1). Con- 
sistent with our recent report that VSG significantly alters the expression 
of hepatic genes involved in lipid and bile acid metabolism’, the path- 
way ‘Nuclear receptors in lipid metabolism and toxicity’ emerged as 
one of the top pathways enriched in genes differentially regulated (fold 
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Figure 1 | Unbiased pathway analysis in VSG ileum. a, Based on a pathway 
analysis of genes differentially regulated (fold change =1.5) in the terminal 
ileum of VSG mice relative to sham-operated, pair-fed (SPF) controls, 

the following top five pathways were significantly enriched as follows: 
glutathione mediated detoxification (P = 6.00 X 10°), nicotine degradation 
(P=5.64X 10 *), metapathway biotransformation (P = 4.81 X 10°), 
nuclear receptors in lipid metabolism and toxicity (P = 7.18 X 10°), and type 
II interferon signalling (P = 1.25 X 10 *). N, number of genes differentially 
regulated in each of the five pathways; n = 5 mice per group. b, Genes enriched 
in the five pathways identified in a. 


change =1.5) in VSG (P = 7.18 X 10° °). Intriguingly, among the other 
top pathways, several point towards an altered gut microbiota. Specifically, 
the glutathione pathway (P = 6.00 X 10°), which plays important roles 
in nutrient metabolism and antioxidant defence, is known to be sig- 
nificantly altered by the presence/absence of a microbiota—and this is 
associated with changes in bile acid composition’""”. Likewise, many of 
the biotransformations (P = 4.81 X 10° °) that are increased probably 
reflect an altered microbiota, induced for detoxification of metabolites 
such as secondary bile acids. Finally, changes in gut microbial com- 
munities have been extensively linked to altered host immune response 
including interferon-signalling'’*"* (P = 1.25 X 10 *). 


FXR, VSG and body composition 


Because we and others have consistently observed increased circulat- 
ing total bile acids”’°’*'* and increased FXR signalling’’” following 
various bariatric procedures including VSG, and because Fxr was one 
of the regulated genes identified in our pathway analysis, we hypothe- 
sized that FXR-signalling contributes to the metabolic benefits of VSG. 
To test this, we first generated whole-body FXR knockout mice (KO), 
and their wild-type littermate controls (WT; Methods). Because KO mice 
are somewhat resistant to diet-induced obesity’*, we maintained these 
mice on a 60% high-fat diet (HFD) for 10 weeks before surgery, such 
that both WT and KO mice became obese. Both genotypes carried more 
than 30% of their body weight as fat before surgery (Fig. 2a, b); our 
previous data indicate that this level of adiposity provides more-than- 
sufficient opportunity for VSG-induced body weight and fat loss*"”. 

VSG or sham surgery was performed in these obese male WT and 
KO mice. In the first week after surgery, WT and KO mice lose a com- 
parable amount of weight relative to their sham-operated controls. 
However, whereas WT-VSG mice maintained this relative weight loss 
for the duration of the experiment (Fig. 2c, repeated measures analysis 
of variance (RM ANOVA) with Tukey post-hoc tests, P< 0.001), the KO- 
VSG animals recovered it. Within 5 weeks, the body weight of KO-VSG 
animals was no longer different from that of sham-operated controls 
(Fig. 2d, RM ANOVA with Tukey post hoc, P < 0.001). Moreover, when 
we measured body composition at 11 weeks after the operation, WT- 
VSG mice had half the body fat of sham-operated WT controls, while 
the body fat of KO-VSG and KO-sham operated mice was equivalent 
(Fig. 2e, f, two-way ANOVA with Tukey post hoc, P< 0.01). 

We note that KO-sham mice continue to gain weight at a slower rate 
than WT-sham mice following surgery; this needs to be taken into con- 
sideration. However, the failure of FXR-VSG mice to sustain body weight 
and body fat loss is unlikely to be a product of KO mice being relatively 
resistant to diet-induced obesity, for two reasons. First, we have observed 
potent reductions in body fat after VSG in mice that have as little as an 
average 7 g of body fat*”, whereas the FKR-KO mice in this study had 
an average of 9 g. Second, we executed post-hoc analyses, selecting the 
heaviest KO and the lightest WT mice to create a subset with compar- 
able body weights and composition before the surgery. Importantly, 
in this subset WT-VSG mice maintain an 11.3 g decrement in body 
weight and a 10.2 g decrement in body fat compared to WT-sham, 
whereas KO-VSG mice recover to match the body weight and body 
fat of KO-sham controls within 4-5 weeks. Further, KO-VSG mice are 


Figure 2 | FXR contributes to the maintenance 
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significantly heavier than WT-VSG mice by 8 weeks after surgery (Ex- 
tended Data Fig. la—c, two-way ANOVA with Tukey post hoc, P < 0.001) 
and maintain a greater fat mass than WT-VSG mice at 11 weeks after 
surgery (Extended Data Fig. 1d, e, two-way ANOVA with Tukey post 
hoc, P< 0.05). 


FXR, VSG and feeding behaviour 


To understand how KO-VSG animals were able to recover this body 
weight, we measured weekly HFD intake in these singly housed mice. 
Both WT and KO mice consumed fewer calories during the first week 
following VSG as compared to sham surgery. However, whereas WT- 
VSG mice maintained this lower food intake for up to 3 weeks (Fig. 3a, 
RM ANOVA with Tukey post hoc, P < 0.001), KO-VSG mice were only 
hypophagic during the first week (Fig. 3c, RM ANOVA with Tukey post 
hoc, P< 0.001). Importantly, and in contrast to what occurs when ani- 
mals are calorically-restricted in the absence of surgery”, this caloric 
deficit was not recovered at later time points in WT-VSG mice, allow- 
ing for the maintenance of reduced body weight and body fat over the 
course of the study (Fig. 3a, b). On the other hand, beginning in post- 
operation week 4, KO-VSG mice actually consumed more food than 
sham operated controls (Fig. 3c, d, Tukey post hoc P < 0.05), suggesting 
that FXR signalling is necessary for the repression of rebound hyper- 
phagia following caloric restriction initially achieved by VSG. This result 
is also consistent with our growing understanding that VSG is a meta- 
bolic, rather than purely restrictive, procedure®, as both genotypes are 
left with a similarly small stomach after VSG. 

Eight weeks after surgery, WT- VSG mice had consumed 15% fewer 
calories than WT-sham mice, but cumulative food intake by KO-VSG 
mice was equivalent to sham-operated controls (Fig. 3e, two-way ANOVA 
with Tukey post hoc, P< 0.01). Further, when we allowed the mice to 
choose among three pure macronutrient diets, WT- VSG mice had a sig- 
nificantly reduced preference for fat compared to WT-sham. Unexpect- 
edly, KO-sham mice already exhibit a substantial reduction in the prefer- 
ence for fat compared to WT-sham. This was not further reduced by VSG 
(Fig. 3f, g, two-way ANOVA with Tukey post hoc, P< 0.05). However, 
we cannot exclude the possibility that the lack of an effect in the KO mice 
represents a physiological ‘floor’ that cannot be further reduced by VSG. 


FXR, VSG and glucose tolerance 

To investigate whether FXR-signalling contributes to improvements 
in glucose tolerance observed following VSG, we challenged fasted mice 
with an intraperitoneal (i.p.) injection of dextrose at a dose of 1 g per 
kg-body weight. VSG was associated with a 20% decrease in fasting 
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blood glucose in WT mice, whereas it was associated with a 24% increase 
in fasting blood glucose in KO mice (Fig. 4a, two-way ANOVA with Tukey 
post hoc, P< 0.001). WT-VSG mice exhibited a substantial improve- 
ment in the ability to clear the ip. injection of glucose, reflected as a 35% 
reduction in the area under the curve (AUC) relative to sham-operated 
controls. In contrast, KO-VSG and sham-operated mice exhibited no 
differences in glucose tolerance (Fig. 4b, c, two-way ANOVA with Tukey 
post hoc, P< 0.01; t-test, P< 0.001). Although the present findings do 
not address the question of whether the improvements observed in 
WT-VSG relative to WT-sham mice are independent of weight loss, we 
note that KO-VSG and WT-VSG mice had equivalent body weights at 
the time of this glucose tolerance test. Despite this, when the glucose 
excursions of WT-VSG and KO-VSG mice are compared directly, KO- 
VSG mice exhibit significantly impaired glucose clearance at both 30 
and 60 min (two-way RM ANOVA with Tukey post hoc, P < 0.05, Ex- 
tended Data Fig. 2). 


FXR, VSG and gut microbial communities 


Gut microbial communities are altered in obese individuals” and respond 
to changes in caloric intake and macronutrient content of the diet”. 
Recent evidence implicates changes in gut microbial communities, both 
in humans and mice, as potential contributors to the benefits of weight- 
loss surgery’**°. Transplantation of the gut microbiota from obese mice 
into germ-free mice is associated with weight gain*””*, whereas trans- 
plantation of gut microbiota from RYGB mice is associated with reduced 
weight gain compared to transplantation with sham microbiota”. 

Because bariatric surgery affects the prevalence of many microbial 
species, it has been difficult to ascertain which specific changes might 
functionally contribute to its benefits. The present study provides an 
opportunity to address this question, by comparing the interacting effects 
of FXR deficiency and VSG on gut microbial ecology. Specifically, because 
FXR KO mice are refractory to the metabolic benefits of VSG, we infer 
that any changes in gut microbial communities that occur after VSG irre- 
spective of genotype are not sufficient (but may nonetheless be necessary) 
to recapitulate its benefits. On the other hand, changes that result from an 
interaction between genotype and surgery are more likely to be critical to 
the potent effect of VSG. Pertinent to this, bile acids can modulate the 
abundance and composition of gut microbes, both directly, and indirectly 
by activation of FXR-signalling””°. Conversely, the gut microbiota can 
modify bile acid composition by microbial enzymatic activities, thereby 
altering both bile acid homeostasis and FXR signalling*’”. 

Mice were killed 14 weeks following surgery, and we extracted geno- 
mic DNA from caecal contents and performed pyrosequencing of the 


Figure 3 | FXR alters feeding behaviour after 
o VSG. a, b, WT mice eat significantly less during 
the first 3 weeks after VSG versus sham surgery; 
this is not recovered at later time points. c, d, KO 
mice eat significantly less during the first week after 
VSG versus sham surgery; beginning at week 4, 
KO-VSG mice compensate for this initial caloric 
deficit by consuming more calories than sham- 
operated controls. e, By 8 weeks after surgery, 
cumulative food intake by WT-VSG mice was 15% 
less than WT-sham mice, whereas food intake by 
KO-VSG and sham-operated mice was equivalent. 
f, When given the choice among three pure 
macronutrient diets, WT-VSG mice exhibit a 
blunted preference for dietary fat relative to 
carbohydrates (Carb.) and protein (Pro.). 
g, Among WT mice, VSG reduces the preference 
for dietary fat. Moreover, loss of FXR is associated 
with lack of preference for dietary fat that is not 
further altered by the surgery. Data are shown as 
mean + s.e. *P < 0.05, **P < 0.01, ***P < 0.001. 
For panels a-e, n = 13 WT-sham, 10 WT-VSG, 
9 KO-sham, 10 KO-VSG. For panels f, g, n = 12 
WT-sham, 6 WT-VSG, 8 KO-sham, 7 KO-VSG. 


-8- KO-VSG 


3 4 5 6 7 8 


Sham &3 VSG 


a oo 
oO Oo 


consumed as fat 
a 
oO 


8 MAY 2014 | VOL 509 | NATURE | 185 


©2014 Macmillan Publishers Limited. All rights reserved 


ARTICLE 


-e WT-sham_ -@- WT-VSG c 
-m- KO-sham_ -&- KO-VSG 


[1 Sham EH VSG 


50,000 ig 
40,000 
S 30,000 


=? 
20,000 


Tr = 
2 ‘s 
£ E 
B oO 
[e) ao 
(S) fo} 
f= | [e) 
3 =| 
Oo [>)) 
[e) me} 
a 8 
m a 
4,000 : 
0 30 


Figure 4 | FXR contributes to the improvement in glucose tolerance 

observed following VSG. a, Among WT mice, fasting blood glucose is reduced 
following VSG, whereas among KO mice it is increased. b,c. Among WT mice, 
VSG improves the excursion of blood glucose following an i.p. bolus of 1 gkg”* 


16S ribosomal RNA gene. Because all the mice in this study were main- 
tained on the same purified HFD, and because the average caloric intake 
by WT-VSG mice was only reduced in the first 3 weeks of this 14-week 
study (Fig. 3), any observed differences are unlikely to be secondary to 
differing caloric intake or macronutrient content of the diet. Moreover, 
because WT-VSG, KO-sham and KO-VSG mice had equivalent body 
weights at death, any observed differences among these groups are unlikely 
to be secondary to differing body weight or composition. 

Weighted UniFrac analysis (sensitive to abundances of taxa)*’ showed 
separation of WT-VSG samples from WT-sham samples along the first 
principal component, indicating that VSG had a stronger effect on micro- 
biota composition in WT mice than in KO mice (Fig. 5a, Extended Data 
Fig. 3, two-way ANOVA with Tukey post hoc). As shown in Fig. 5b, c, the 
separation of samples in the principal components analysis plot reflects 
differences in the Bacteroidetes and Firmicutes levels. In light of these 
differences, we further identified specific taxonomic changes that result 
from an interaction between genotype and surgery, as these may rep- 
resent microbial groups that are critical to the potent effects of bariat- 
ric surgery—though future studies involving colonization of germ-free 
mice would be required for definitive evidence supporting a functional 
involvement. 

The relative abundance of several bacteria previously identified as 
important modulators of systemic metabolism was altered differently 
by VSG according to genotype. Specifically, the relative abundance of 
Bacteroides was substantially reduced in WT-VSG mice relative to WT- 
sham controls, but did not vary with surgery among KO mice (Fig. 5d, 
two-way ANOVA with Tukey post hoc, P< 0.05). A recent study investi- 
gating the therapeutic potential of probiotic treatment in obese women 
found that decreasing prevalence of Bacteroides was associated with de- 
creasing adiposity and improved glucose control™. Likewise, in the present 
study, decreasing abundance of Bacteroides was associated with decreased 
weight gain, fat gain (Extended Data Fig. 4), and decreased AUC ina 
glucose tolerance test—suggesting a functional association that depends 
on FXR. 

An uncharacterized genus in the Porphyromonadaceae family, a 
bacterial group negatively correlated with type-1 diabetes**, and decreased 
in HFD-fed mice”’, was substantially increased in WT-VSG mice rela- 
tive to WT-sham controls. In contrast, this group was reduced in KO-VSG 
mice relative to their sham-operated controls (Fig. 5e, two-way ANOVA 
with Tukey post hoc, P< 0.05). 

Finally, the relative abundance of Roseburia was 12-fold higher in 
WT-VSG caeca compared to WT-sham, but the abundance of this 
microbe in both KO groups resembled the WT-sham group (Fig. 5f, 
two-way ANOVA with Tukey post hoc, P < 0.05). Importantly, and con- 
sistent with our findings, two recent gut metagenome analyses identified 
decreased abundance of Roseburia as highly discriminant for type-2 
diabetes mellitus, in both a European and a Chinese population**”’. 
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dextrose; among KO mice, the glucose tolerance of VSG and sham-operated 

mice is equivalent. AUC, area under the curve. Data are shown as mean + s.e. 
*P< 0.05, **P< 0.01, ***P < 0.001. n = 13 WT-sham, 10 WT-VSG, 9 KO- 

sham, 10 KO-VSG. 


Likewise, a study investigating the therapeutic effect of a prebiotic treat- 
ment in rodents found that increased abundance of Roseburia was 
associated with weight loss and improved glucose tolerance, indepen- 
dent of food intake**. In the present study, the abundance of Roseburia 
was negatively correlated with AUC in the glucose tolerance test 
(Extended Data Fig. 4), and this was due primarily to the relationship 
within the WT-VSG group (R? = 0.75, P< 0.05, Fig. 5g), suggesting an 
FXR-dependent functional association. 

In addition, we observed genotype-independent changes in the rela- 
tive abundance of the genera Lactobacillus and Lactococcus and of 
Enterobacteriaceae (Extended Data Fig. 5). Consistent with this, an 
increase in Escherichia coli and other enterobacteria has likewise been 
reported after RYGB**”*. Such changes may indeed contribute to the 
benefits of VSG and other bariatric procedures, but the present data 
suggest they are not sufficient to elicit improvements in energy balance 
and glucose homeostasis. 

Changes in gut microbiota may influence host metabolism in part 
due to the ability of the microbiota to attain colonic dietary conversion, 
resulting mostly in the production of fatty acid end products. Pertinent 
to this, a wide range of data now links the production of short-chain 
fatty acids (SCFAs) to various metabolic outcomes***””**. To this end, 
we measured caecal SCFAs and other organic acids produced by the 
gut microbiota. There were no differences in the total abundance of 
SCFAs among the four groups (Extended Data Table 1). The relative 
abundances of butyrate and propionate (Extended Data Fig. 6a, b), but 
not acetate (Extended Data Fig. 6c) were altered by VSG and this did 
not differ between genotypes (two-way ANOVA with Tukey post hoc, 
P<0.05). The resulting decrease in the acetate:butyrate ratio (Extended 
Data Fig. 6d, two-way ANOVA with Tukey post hoc, P < 0.001) possi- 
bly indicates that acetate is more efficiently converted to butyrate follow- 
ing VSG. Consistent with the increase in Lactobacillus and Lactococcus, 
we also observed an increase in lactate following VSG (Extended Data 
Table 1). Again, although such changes may indeed contribute to the 
benefits of VSG and other bariatric procedures, the present data suggest 
they are not sufficient to elicit metabolic improvements. 


Discussion 


Though a number of local physiological and environmental variables 
may be altered by VSG, here we have identified a critical role for altered 
bile acid signalling. Taken together, the present results demonstrate that 
a functional FXR pathway is necessary for sustained weight loss, sup- 
pression of rebound hyperphagia and improved glucose control follow- 
ing VSG. Importantly, KO mice have an equivalently small stomach, 
yet they do not demonstrate the same improvements as WT mice. Thus 
our findings directly contradict the common assumption that mech- 
anical restriction, resulting from the reduced stomach size, is sufficient 
to mediate the therapeutic effects of VSG. 
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Figure 5 | VSG and FXR alter the composition of caecal microbial 
communities. a, Principal coordinates analysis plot of weighted UniFrac 
distances. Each dot represents a caecal community with the various surgical/ 
genotype groups indicated by colour. The percentage of variation explained by 
each principal coordinate (PC1, PC2, PC3) is shown in parentheses. 

b, c, Abundance gradient of Bacteroidetes (b) and Firmicutes (c). Each dot is in 
the same position as it was in a but the colour now indicates the relative 
abundance (blue indicating low abundance to red indicating high abundance) 
of Bacteroidetes and Firmicutes respectively. d, Relative abundance of 
Bacteroides was reduced in WT-VSG compared to WT-sham operated mice, 


In agreement with this, FXR-signalling is implicated in the regula- 
tion of lipid and glucose metabolism*'™. Initial reports emphasized a 
protective role, concluding that FXR signalling is necessary for normal 
glucose homeostasis. FXR KO mice exhibit impaired glucose and insu- 
lin tolerance when maintained on a chow diet*’, despite modestly ele- 
vated plasma bile acids. In apparent contrast, however, they are resistant 
to HFD-induced obesity and glucose intolerance’*. Intriguingly, two 
studies investigating the effect of the synthetic FXR agonist GW4064 on 
the development of HFD-induced obesity and glucose intolerance***° 
yielded opposite results. Whereas Watanabe et al.*° report greater sen- 
sitivity to diet-induced metabolic dysfunction, Ma et al.* report res- 
istance to diet-induced obesity following treatment with GW4064. A 
major difference between the two studies was route of administration. 
Watanabe et al. delivered the drug orally via the diet, whereas Ma et al. 
delivered it intraperitoneally. FXR is expressed in multiple organs import- 
ant for the regulation of metabolism, including liver, adipose tissue and 
intestine, and its activation in these compartments may elicit opposing 
effects on metabolism, perhaps contributing to this discrepancy. Rele- 
vant to this, bile acids may also affect metabolism by signalling via the 
G-protein-coupled receptor TGRS, raising the possibility of cross-talk 
between these systems”. Finally, we note these contradictory phenotypes 
have likewise been observed in the complex regulation of gene expression 
by other members of the nuclear receptor family. The PPARy haploin- 
sufficient mouse, for example, is more insulin sensitive despite the fact 
that PPARy agonists are well-known for their insulin-sensitizing effects*. 
Like PPARy, FXR and other nuclear receptors may either activate or 
repress the transcription of target genes, depending on the presence of 
co-activator and co-repressor complexes. Therefore, loss-of-FXR func- 
tion might paradoxically be associated with a relative increased expres- 
sion in a subset of FXR-regulated genes, while still making the animal 


but did not differ in KO mice. e, Relative abundance of one genus in the 
Porphyromonadaceae family was increased in WT-VSG compared to 
WT-sham operated mice, but was reduced in KO-VSG mice relative to 
KO-sham operated controls. f, Relative abundance of Roseburia was increased 
in WT-VSG compared to WT-sham operated mice, but did not differ in KO 
mice; individual data points show outliers. g, Among WT-VSG mice, the 
relative abundance of Roseburia was tightly correlated with fasting blood 
glucose. Data are presented as Tukey box-plots. *P < 0.05, **P < 0.01, 

***D < 0,001. n = 12 WT-sham, 7 WT-VSG, 9 KO-sham, 8 KO-VSG. 


resistant to treatments that activate FXR. This may explain the present 
conundrum that FXR KO mice are simultaneously resistant to some of 
the deleterious metabolic effects of consuming a HFD and also relatively 
unresponsive to the metabolic benefits of VSG. 

Bariatric surgical procedures, such as VSG, are currently the most effec- 
tive therapy for the treatment of obesity and are moreover associated 
with substantial improvements in co-morbidities, including type-2 dia- 
betes mellitus. The underlying molecular mechanisms contributing to 
these benefits remain largely undetermined, despite offering tremend- 
ous potential to reveal new targets for therapeutic intervention. Progress 
towards this goal has probably been hindered by a widespread belief 
that surgery exerts its effects simply by making it physically difficult to 
consume or absorb calories. The current work provides an alternative 
framework—that is, it identifies FXR-signalling as a molecular mech- 
anism necessary for both key changes to gut microbial communities 
and many associated metabolic benefits of VSG. These findings provide 
a new understanding of the mechanisms that underlie the benefits of 
bariatric surgery and suggest new targets for developing less-invasive 
therapeutic interventions. 


METHODS SUMMARY 

RNA-seq and unbiased pathway analysis. These were performed using RNA 
collected from the distal small intestine of VSG and sham-operated, pair-fed male 
(SPF) mice that were the subjects of a previously published study’. 

Animals. Singly-housed FXR KO mice and their WT littermates” were allowed 
ad libitum access to a high-fat diet (HFD) and water unless otherwise noted. VSG 
or sham surgery was performed as described elsewhere’. 

Microbiota. Caecal microbiota composition was analysed by pyrosequencing of 
the V1-V2 region from the bacterial 16S rRNA gene. PCR amplicons were gen- 
erated from caecal DNA. Sequence data were analysed by using the QUIME soft- 
ware package (version 1.5.0)°. 
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METHODS 


Protocols. The Institutional Animal Care and Use Committees at the University 
of Cincinnati and Cincinnati Children’s Hospital Medical Center approved all 
animal protocols. 

RNA-seq and unbiased pathway analysis. For RNA-seq followed by unbiased 
pathway analysis, the distal 1.5 cm of small intestine was collected from 5 VSG and 
5 sham-operated, pair-fed male (SPF) C57B1/6J mice (Jackson Labs, BarHarbour, 
ME) that were the subjects of a previously published study’. See ref. 9 for further 
methodological detail. RNA-seq libraries prepared with the Illumina TruSeq RNA 
preparation kit were sequencing on an Illumina Hi-Seq 2000, with single-end 50 bp 
reads. Analysis was performed in GeneSpring NGS (Agilent Technologies, Santa 
Clara, CA), where sequences were aligned to the mm9 reference mouse genome 
and assigned Ensembl annotations. Aligned gene reads that passed quality controls 
(quality threshold =30, zero Ns allowed, no multiply mapping reads) were quan- 
tified and used to calculated reads per kilobase per million (RPKM). Raw data were 
normalized using the DeSeq algorithm where reads were thresholded to a min- 
imum read count of 1. An additional filter required at least 10 reads per entity in 
each sample of either VSG or SPF samples (N = 14,591 entities). Differentially expressed 
genes with a fold change of =1.5 between VSG and SPF animals (N = 882) were 
submitted to pathway analysis, which assessed significance of overlap between the 
882 differential genes and those genes in pathways curated by Wikipathways, BioCyc, 
BioPax and Legacy, with a P-value cut-off of 0.05. 

Surgery. VSG surgery was performed in DIO mice using isofluorane anaesthesia. 
The lateral 80% of the stomach was excised leaving a tubular gastric remnant in 
continuity with the oesophagus superiorly and the pylorus and duodenum infer- 
iorly. The sham procedure involved analogous isolation of the stomach followed 
by manually applying pressure with blunt forceps along a vertical line between the 
oesophageal sphincter and the pylorus. Mice consumed liquid diet (Osmolite OneCal) 
for the first 3 post-operative days, and were re-introduced to HFD on day 3. 
Mice. 10-12-week-old male FXR KO mice and their WT littermates, described 
elsewhere” and bred from founders acquired from the Jackson Labs (Bar Harbour, 
ME), were singly housed in an AAALAC-approved facility with a 12-12 light-dark 
cycle, and allowed ad libitum access to HFD (D12492, Research Diets, New Brunswick, 
NJ) and water unless otherwise noted. After 10 weeks of HF-feeding, male mice 
were randomly divided into body weight and fat-matched groups within genotype, 
and we performed VSG or sham surgery as above. Based on our power analyses, 
completed using previous data to determine the expected variance and effect sizes, 
and based on our historical data regarding post-operative mortality, we performed 
sham surgery on 13 WT and 9 KO mice and VSG surgery on 14 WT and 15 KO 
mice. 5 KO-VSG and 4 WT-VSG mice died from complications of surgery and 
were not included in any of the analyses. Food intake and body weight was mon- 
itored weekly. Body composition was measured 1 week before and 11 weeks fol- 
lowing surgery by magnetic resonance imaging (Echo Medical Systems, Houston, 
TX). An additional 1 WT-sham, 2 KO-VSG and 2 WT-VSG mice died (or were 
found moribund) during the period of metabolic phenotyping that occurred between 
weeks 8 and 15. For any given analysis, only mice that are represented throughout 
the entire timecourse were included. In addition, for the macronutrient preference 
study, data from several individuals were excluded based on pre-established criteria— 
in this case, that the mice consistently spilled the powdered macronutrient diets on 
the cage floor making accurate measurement of what was ingested impossible. For 
the microbiota analyses, one WT-VSG individual was excluded due to a noted 
investigator error while collecting the caecal samples at sacrifice. Investigators 
conducting the gene expression and microbiota analyses were blind to the experi- 
mental groups. 

Macronutrient preference. Food choice was assayed during postoperative week 
10 using a macronutrient selection paradigm, in which 3 pure macronutrient diets 
(Harlan Teklad; TD.02521 (carbohydrate), TD.02522 (fat) and TD02523 (protein)) 
were presented simultaneously in separate containers for 6 days (n = 12 WT-sham, 
6 WT-VSG, 8 KO-sham, 7 KO-VSG). 

Glucose tolerance tests. Glucose tolerance was assayed during postoperative week 
8 in 5-h fasted mice. Blood glucose was measured with AccuCheck meters and 
strips (Roche, Indianapolis, IN) from the tip of the tail vein in unrestrained animals 
at baseline, and at regular intervals following an i.p. injection of 1 gkg ' body 
weight dextrose (n = 13 WT-sham, 10 WT-VSG, 9 KO-sham, 10 KO-VSG). 
Analysis of gut microbial communities. Whereas much of the microbiota is 
dead in the faeces, the caecum is an active bioreactor with high metabolic ferment- 
ative capacity. Thus we focused our study on the caecal microbiota. Caecal micro- 
biota composition was analysed by pyrosequencing of the V1-V2 region from the 
bacterial 16S rRNA gene. PCR amplicons were generated from caecal DNA obtained 
from WT and FXR KO mice that underwent either VSG or sham operations (n = 12 
WT-sham, 7 WT-VSG, 9 KO-sham, 8 KO-VSG). Sequence data were analysed by 
using the QIIME software package (version 1.5.0)°°. 
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Sample processing, extraction of genomic DNA and amplification for 454 
pyrosequencing. Aliquots of caecal material were collected and immediately fro- 
zen in liquid nitrogen. These aliquots were stored at —80 °C until extracted: approxi- 
mately half of each aliquot was used for isolation of genomic DNA and the other 
half was used for extraction of SCFAs and microbial metabolites. 

Genomic DNA was isolated from 50-80 mg of caecal content using the repeated 

bead beating (RBB) method previously described by Salonen et al.*'. The V1-V2 
region of the 16 rRNA gene was amplified using the 27F and 338R primers fused 
with 454 Titanium sequencing adapters. 338R primers contained unique error- 
correcting 12-base barcodes that allow to tag PCR products from different samples”. 
Each sample was amplified in triplicate in a reaction volume of 25 pl containing 
1.5 U of FastStart Taq DNA Polymerase (Roche), 0.2 1M of each primer and 10- 
20 ng of genomic DNA. PCR was carried out under the following conditions: initial 
denaturation for 3 min at 95 °C, followed by 25 cycles of denaturation for 20s at 
95°C, annealing for 20s at 52°C and elongation for 60s at 72°C, and a final 
elongation step for 8 min at 72 °C. Triplicates were combined, purified with the 
NucleoSpin Gel and PCR Clean-up kit (Macherey-Nagel, Germany) and then quan- 
tified using the Quant-iT PicoGreen dsDNA kit (Invitrogen, Carlsbad, CA). Purified 
PCR products were diluted to a concentration of 20 ng pl”! and pooled in equal 
amounts. The pooled amplicons were purified again with the Ampure magnetic 
purification beads (Agencourt, Danvers, MA) to remove short amplification pro- 
ducts. Sequencing was performed using 454 GS FLX titanium chemistry at GATC 
Biotech (Konstanz, Germany). 
Analysis of 16S rRNA gene sequences. Raw data were quality filtered to remove 
sequences that were shorter than 200 nucleotides, longer than 1,000 nucleotides, 
contained primer mismatches, ambiguous bases, uncorrectable barcodes, or homo- 
polymer runs in excess of six bases. Quality filtered reads were trimmed of 454 
adaptor and barcode sequences, and were analysed with the software package 
Quantitative Insights Into Microbial Ecology (QIIME)” (version 1.5.0). A total 
of 376,965 sequences were obtained for 36 samples; sequences were demultiplexed 
and an average of 10,471 sequences were attributed to each sample (range: 7,208- 
12,522 sequences). 

Sequences were assigned to operational taxonomic units (OTUs) using UCLUST 

with a 97% threshold of pairwise identity. The most abundant sequence was picked 
as representative for each OTU and was given taxonomic assignment using the 
Ribosomal Database Project (RDP) Classifier*. Representative OTUs were aligned 
using Pynast* and used to build a phylogenetic tree with FastTree™*, which was 
used to estimate B-diversity of samples (weighted UniFrac**). Sequences were checked 
for chimaeras using ChimeraSlayer and chimaeric sequences were excluded from 
all downstream analyses. Similarly, singletons and sequences that could not be 
aligned with Pynast were also excluded. 
Extraction of microbial metabolites (SCFAs and other organic acids). GC-MS 
was used for measurement of organic acids in mouse caecal samples (n = 12 WT- 
sham, 7 WT-VSG, 9 KO-sham, 8 KO-VSG). After extraction with diethyl ether, 
an aliquot of each sample was derivatized with N-tert-butyldimethylsilyl-N- 
methyltrifluoracetamide (MTBSTFA; Sigma), and the organic acids were quan- 
tified by using a gas chromatograph (Model 7890 A, Agilent Technologies) coupled 
to a mass spectrometer detector (Model 5975 C, inert XL MSD with triple Axis 
Detector; Agilent Technologies). 50-80 mg of frozen caecal contents were trans- 
ferred to glass tubes (16 X 125 mm) fitted with a screw cap, and a volume of 100 pl 
of internal standards stock solution ([{1-'*C]acetate and [*H.] propionate 1 M, 
['°Cy]butyrate 0.5 M, [1-'°C, Jisobutyrate and [1-'*C]isovalerate 0.1 M, [1,2-'°C)] 
hexanoate, [‘*C]lactate and ['°C,]succinic acid each at 40 mM) was added to 
the tubes. Prior to extraction samples were freeze-dried at —50°C for 3h (yield 
8-15 mg dry weight). After acidification with 50 ul of 37% HCl, the organic acids 
were extracted twice in 2 ml of diethyl ether. A 500 pl aliquot of the extracted 
sample was mixed together with 50 jl of MTBSTFA (Sigma) at room temperature. 
An aliquot (1 pl) of the resulting derivatized material was injected into a gas chro- 
matograph (Agilent Technologies 7890 A) coupled to a mass spectrometer detector 
(Agilent Technologies 5975 C). A linear temperature gradient was used: the initial 
temperature of 65 °C was held for 6 min, increased to 260 °C (15 °C min ') and 
then to 280 °C for 5 min. The injector and transfer line temperatures were 250 °C. 
Quantitation was completed in selected ion monitoring acquisition mode by com- 
parison to labelled internal standards (valerate was compared to [1-'*C]isovalerate, 
heptanoate and octanoate were compared to [1,2-'*C,]hexanoate and fumarate was 
compared to ['°Cy]succinic acid). The m/z ratios of monitored ions were as fol- 
lows: 117 (acetic acid), 131 (propionic acid), 145 (butyric acid), 146 (isovaleric acid), 
159 (isovaleric acid and valeric acid, 173 (hexanoic acid), 187 (heptanoic acid), 201 
(octanoic acid), 261 (lactic acid), 287 (fumaric acid), 289 (succinic acid), 121 ([?H»]- 
and [1-'*C]acetate), 136 ((?Hs]propionate), 146 ([1-'C; Jisobutyrate), 149 ([!3Ca] 
butyrate), 160 ({1-'8Cl]isovalerate), 175 ([1,2-'°C,]hexanoate), 264 ({'*C]lactate) 
and 293 ([{1°C,4]succinic acid). 
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Statistical analyses. Data are presented as mean + standard errors or as Tukey 
box-plots, as noted. Data were analysed using the appropriate ANOVA or repeated 
measures ANOVA (RM ANOVA), followed by Tukey’s post-hoc tests, or by t-test 
as indicated. The data were tested for normality and homogeneity of variance. In 
cases where the data failed these assumptions, the analyses were also done after 
rank transformation. In no cases did this alter the statistical significance or inter- 
pretation of the findings, so the statistics reported in the figures are from the non- 
transformed analyses. Data were analysed using Prism (Graph Pad, San Diego CA) 
or Sigma Stat (SYSTAT, San Jose CA) software with the critical value, «, set at 
P<0.05. 
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Extended Data Figure 1 | Body weight and body fat in a weight-matched 
subset of WT and FXR KO mice. These post-hoc analyses include the lightest 
9 WT and heaviest 8 FXR-KO mice before surgery, creating 4 well-matched 
groups. a-d, In this subset, WT-VSG lose weight relative to WT-sham controls, 
and maintain this weight loss for 14 weeks (a), whereas KO-VSG mice lose 
weight initially, but recover to match the weight of KO-sham controls within 
4-5 weeks (b). Consequently, by week 8 KO-VSG mice were heavier than 
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WT-VSG mice (c). Likewise, these groups were well-matched for pre-surgical 
body fat (d). At 11 weeks after surgery, WI-VSG mice had significantly less 
body fat compared to both WT-sham controls and KO-VSG mice. KO-sham 
and KO-VSG mice had equivalent adiposity (e). Data are shown as mean + s.e. 
*P < 0.05, **P < 0.01, ***P < 0.001. For all panels n = 5 WT-sham, 

4 WT-VSG, 4 KO-sham, 4 KO-VSG. 
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Extended Data Figure 2 | Glucose tolerance in WT-VSG and KO-VSG mice. _ directly, KO-VSG mice exhibit significantly impaired glucose clearance at both 
When the glucose excursion of WT-VSG and KO-VSG mice are compared 30 and 60 min. Data are shown as mean = s.e. *P < 0.05. n = 10 per group. 
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Extended Data Figure 3 | Effect of genotype and VSG on distribution along 
PC1. Among WT mice, sham and VSG mice separate significantly along PC1. 
In contrast, among KO mice there is no significant difference between sham 
and VSG. Data are shown as mean + s.e. ***P < 0.001. n = 12 WT-sham, 

7 WT-VSG, 9 KO-sham, 8 KO-VSG. 
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Extended Data Figure 4 | Relative abundance of Bacteroides, an 
uncharacterized genus in Porphyromonadaceae, and Roseburia correlated 
with metabolic parameters. a-d, The relative abundance of Bacteroides was 
significantly correlated with change in body weight (a), change in body fat 
(b), and the area under the curve (AUC) in the glucose tolerance test (d), 
but not with fasting blood glucose (c). e-h, The relative abundance of an 
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uncharacterized genus in Porphyromonadaceae was significantly correlated 
with change in body weight (e), fasting blood glucose (g) and AUC in the 
glucose tolerance test (h), but not with change in body fat (f). i-l, The relative 
abundance of Roseburia was significantly correlated with change in body 
weight (i), change in body fat (j), fasting blood glucose (k) and AUC in the 
glucose tolerance test (1). n = 36. 
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Extended Data Figure 5 | Effect of genotype and VSG on the relative (a), Lactococcus (b) and Escherichia/Shigella (c) that did not vary according 
abundance of Lactobacillus, Lactococcus and Escherichia. a-—c, VSG was to genotype. Data are presented as Tukey box-plots. n = 12 WT-sham, 


associated with a significant increase in the relative abundance of Lactobacillus 7 WT-VSG, 9 KO-sham, 8 KO-VSG. 
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Extended Data Figure 6 | VSG alters the abundance of caecal SCFAs. d, The acetate:butyrate ratio is increased following VSG. Data are presented as 
a-c, The relative concentration of butyrate (a) and propionate (b), but not Tukey box-plots. *P < 0.05, **P < 0.01. Also see Extended Data Table 1. 


acetate (c), was altered by VSG, and this did not differ depending on genotype. = 12 WT-sham, 7 WT-VSG, 9 KO-sham, 8 KO-VSG. 
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Extended Data Table 1 | The effect of VSG on caecal organic acids 


Organic acid 
Acetate (umol/g) 
Propionate (umol/g) 
Isobutyrate (umol/g) 
Butyrate (umol/g) 
lsovalerate (umol/g) 
Valerate (umol/g) 
Hexanoate (mol/g) 
Hepthanoate (mol/g) 
Octanoste (umol/g) 
Lactate (umol/g) 
Succinate (umol/g) 
Fumarate (umovg) 


Total SCFAs (C2-C4) 


Breakdown of the caecal organic acids (mol per g dry weight) in both WT and KO mice after either sham or VSG surgery. 


WT-Sham 


142.33 = 20.45 


24.80 = 3.28 


9.56 21.71 


19.20 = 3.42 


6.2520.99 


2.50 20.41 


1.14 20.37 


0.0080 = 0.0027 


0.013 = 0.0056 


13.63 = 14.80 


4.29 = 2.01 


0.50 = 0.23 


186.33 = 26.50 


WT-VSG 


153.46 = 26.77 


19.60 = 4.29 


10.41 22.24 


24.75 24.48 


7.02 21.3 


2.77 = 0.54 


1.73 = 0.49 


0.0088 = 0.0027 


0.027 = 0.0073 


75.76 = 19.38 


4.55 22.63 


0.82 0.30 


197.82 = 34.70 


KO-Sham 


124.54 = 23.61 


15.14 23.78 


9.112198 


17.55 23.95 


5.78 21.14 


2.19 £0.41 


0.79 = 0.43 


0.0032 = 0.0032 


0.0073 = 0.0064 


6.95 = 17.10 


5.17 £2.32 


0.91 £0.27 


157.23 = 30.60 


KO-VSG 


160.86 = 25.04 


20.32 = 4.01 


10.09 = 2.09 


29.79 = 4.19 


7.08 = 1.21 


2.9920.41 


1.74 = 0.46 


0.0088 = 0.0038 


0.019 = 0.0068 


38.20 = 18.13 


8.81 = 2.46 


1.16 = 0.28 


210.99 = 32.46 
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Astrocyte-encoded positional cues 
maintain sensorimotor circuit integrity 


Anna V. Molofsky'””, Kevin W. Kelley!”*>°*, Hui-Hsin Tsai!'*"**, Stephanie A. Redmond®’, Sandra M. Chang'”, 
Lohith Madireddy’, Jonah R. Chan’, Sergio E. Baranzini’, Erik M. Ullian® & David H. Rowitch?*+? 


Astrocytes, the most abundant cells in the central nervous system, promote synapse formation and help to refine neural 
connectivity. Although they are allocated to spatially distinct regional domains during development, it is unknown whether 
region-restricted astrocytes are functionally heterogeneous. Here we show that postnatal spinal cord astrocytes express 
several region-specific genes, and that ventral astrocyte-encoded semaphorin 3a (Sema3a) is required for proper motor 
neuron and sensory neuron circuit organization. Loss of astrocyte-encoded Sema3a leads to dysregulated o-motor neuron 
axon initial segment orientation, markedly abnormal synaptic inputs, and selective death of a- but not of adjacent y-motor 
neurons. In addition, a subset of TrkA* sensory afferents projects to ectopic ventral positions. These findings demonstrate 
that stable maintenance of a positional cue by developing astrocytes influences multiple aspects of sensorimotor circuit 
formation. More generally, they suggest that regional astrocyte heterogeneity may help to coordinate postnatal neural 


circuit refinement. 


Developing neural circuits must form and maintain appropriate regional 
connections in a rapidly expanding central nervous system (CNS). 
Although astrocytes (AS) are increasingly recognized as general reg- 
ulators of synapse formation’, little is known about whether they 
encode heterogeneous positional signals involved in local neural circuit 
formation and/or maintenance. Recent studies indicate that AS develop 
and are regionally allocated in murine brain and spinal cord (SC) accord- 
ing to an embryonic segmental template**. AS derived from embryonic 
radial glia” migrate in the trajectory of these fibres and proliferate 
locally®’, yielding clonally related populations”* that retain spatial 
restriction into adulthood. 

Here we tested whether AS allocated to discrete dorsal—ventral 
(DV) SC domains might be functionally adapted to support specific 
neural circuits and neuronal subtypes’. The SC sensorimotor circuit 
has well-defined organization in the DV axis (Fig. 1a). The ventral horn 
contains two types of motor neurons (MN), called a-MN and y-MN, 
whose axons exit the ventral root to project to extrafusal («) and intra- 
fusal (y) muscle fibres’®. During development, afferent sensory fibres 
entering from the dorsal root ganglion (DRG) include type 1a prop- 
rioceptive afferents that synapse directly on ventral «-MN, and TrkA* 
sensory axons that synapse in the dorsal grey matter’’. 

Although programs that control MN diversification and connectivity 
are well established”’”, comparatively little is known about non-neuronal 
signals that influence local circuit formation'’’’. We report that ventral 
AS-encoded semaphorin 3a (Sema3a), a secreted molecule that signals 
through plexin A/neuropilin 1 receptor (Nrp1) complexes'*"*, has crit- 
ical roles in orienting MN axon initial segments (AIS), synapse regu- 
lation, MN subtype survival and normal patterning of a subset of TrkA* 
sensory neurons. These findings establish a discrete molecular function 
for region-restricted AS. 


qPCR dorsal/ventral astrocytes 


iS IS 
SOA OH SF 
SRE 


sv 


a Neurons Astrocytes 


10 


Sensory Dorsal om 
5a 
a. 
a 
G 
2 D SAMA 
y 2a LEE SRLS 
Motor ‘<== Ventral = 
Be J Sema3a => | 
c qPCR Sema3a d Relative expression (genechip) 
30 42 MmDas 
= Ca = MV-AS 
89 aii i oe 54 14 [ii D-non-as 
ge ss 400 [ 44 CV-non-AS 
5S 4 35 i 
os 8 = 200 
oO 2 £5 
ut ag 
0 ge o 
oe »? 4? ras Sema3a Thbs2 Gpc4 Gpc6 
GS Astrocyte-secreted genes 
e VH VH inset VH + block Dorsal horn f g Sema3a antibody 
ee g 3° 
<a £ © 40 
e& E E 
of oy oy 
mo} an ao 
eo E ~ 20 
EE 2 8 
BS 2 5 0 
= te QO =Med VH Lat DH 
(O} 


Spinal cord region 


Figure 1 | AS express region-specific genes. a, Concept of DV AS functional 
specialization. b, qPCR validation of differentially expressed genes in 
Aldh1l1-GFP* dorsal or ventral SC AS (2 out of 4 samples had undetectable 
dorsal Sema3a). c, PCR of Aldh111-GFP AS versus non-AS (purity >95%). 
d, Regional expression of Sema3a versus thrombospondin 2 (Thbs2), and 
glypicans 4/6 (Gpc4/Gpc6). D, dorsal; V, ventral. e, Sema3a proteins in ventral 
but not dorsal SC Aldh1l1-GFP* protoplasmic AS (arrows, inset). ‘Block’ 
denotes antibody blocking peptide. DAPI, 4',6-diamidino-2-phenylindole. 
VH, ventral horn. Scale bar, 20 jum. f, Graded expression of Sema3a protein as 
percentage of Sema3a‘/Aldh111-GEP* AS at E18.5. DH, dorsal horn; Med, 
medial; lat, lateral. All data are mean + s.e.m. f-actin is used as a housekeeping 
gene in qPCR. n = 4 (b) or 3 (c-f) biological replicates per bar. 
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AS express DV-restricted positional cues 


To identify regionally distinct molecular differences we purified fib- 
rous and protoplasmic AS from microdissected dorsal and ventral 
postnatal day 7 (P7) SC by flow cytometry using the Aldh1l1-GFP 
transgene reporter~”’”"* (Fig. la and Extended Data Fig. 1). Gene 
expression profiling and bioinformatic analysis identified 38 genes that 
were differentially expressed (Extended Data Fig. 1), and we validated 
these results by quantitative PCR. As shown (Fig. 1b), several genes 
encoded extracellular matrix molecules’® (Hapln1, Pamr1, Enpep, Bgn) 
or factors with positional roles in brain development, including reelin 
(Reln)” and Eph receptor A5 (Epha5; ref. 21). Of these, Sema3a was the 
most highly expressed ventral AS-specific gene, showing over threefold 
higher levels in radial glia and AS (versus non-AS) from embryonic 
day 13.5 (E13.5)-P7 (Fig. 1c), consistent with Sema3a expression in situ 
(Extended Data Fig. 2a—c)’°. In contrast to Sema3a, genes for other AS- 
secreted molecules, including thrombospondin 2 (Thbs2) and glypican 
4 and 6 (Gpc4, Gpc6), were expressed without positional distinction 
(Fig. 1d). Sema3b, -c, -d, -e and -f were low or undetectable in AS (data 
not shown). 

AS Sema3a protein expression showed graded expression, with lowest 
numbers of Sema3a° cells in the dorsal horn and highest numbers in 
the ventral horn. In ventral AS, Sema3a proteins appeared oriented 
towards MN soma (Fig. le, f). Although Sema3a messenger RNA trans- 
cripts were highly expressed in «-MN, we did not detect corresponding 
Sema3a protein levels (Fig. le and Extended Data Fig. 2e). MN express 
the obligate semaphorin receptor Nrp1 (refs 15, 22, 23) (Extended Data 
Fig. 2f). In addition, DRG TrkA* sensory neurons, but not parvalbumin 
(PV)* proprioceptive afferents, express high levels of Nrp1 (Extended 
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Figure 2 | Axon-repulsive effects of AS-encoded Sema3a maintain a-MN 
AIS orientation. a, P7 lumbar SC ChAT/NeuN" o-MNs with AnkG* AIS. 
Ventral root (*) and orientation vector towards it for selected a-MN (white 
arrows). Scale bar, 100 jim. b, Insets of yellow boxed areas in a. Yellow arrow 
denotes AnkG* AIS junction, the angle between these vectors determines AIS 
orientation. c, Overlay of all lumbar AIS orientation angles demonstrates 
that misoriented MN are topographically distributed. d, Mean angle and 
variability is significantly increased in P7 «-MN in absence of AS-encoded 
Sema3a (0° = towards ventral root). e, No difference in AIS orientation in 
y-MN. f, Scatter plot of data generated as in c shows no misorientation of «- or 
-MN by P30. g, AIS angles in AldhIl1cre:Sema3a™ are normal at E14.5 
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Data Fig. 2f-h). These findings suggested potential neuronal subtype- 
specific functions for AS-encoded Sema3a. 


AS Sema3a restricts o-MN AIS orientation 


We first investigated ventral astrocytic Sema3a function during early 
postnatal MN development. To conditionally target Sema3a in AS, we 
crossed Sema3a™' mice“ to hGFAPcre mice”’, chosen because: (1) Cre 
activity commences in late-stage glial progenitors; (2) it targets proto- 
plasmic AS; and (3) Cre activity segregates from SC MN” and inter- 
neurons (Extended Data Fig. 3a). hGFAPcre:Sema3a™" animals survived 
postnatally in near-normal numbers. 

Previous work indicates that Sema3a has roles in supporting dend- 
rite versus axon identity” and hippocampal neuron axon repulsion 
and dendrite growth in vitro”*. To assess a potential role for AS-encoded 
Sema3a in orienting MN axons in vivo we used ankyrin G (AnkG)” to 
define the direction of the AIS relative to the ventral root. By P7, large 
a-MN and smaller y-MN normally exist in a 2:1 ratio in most MN 
pools’®. To determine AIS orientation of both «-MN and y-MN follow- 
ing loss of AS-encoded Sema3a, we used AnkG to identify the AIS 
and NeuN staining to distinguish «%-MN (choline acetyltransferase 
(ChAT)* NeuN*) from y-MN (ChAT* NeuN7 )!°. This analysis was 
performed at both lumbar and cervical levels and the results were plotted 
on a positional grid (Fig. 2a—c and Extended Data Fig. 4). In control 
animals, we found that x-MN AIS were oriented an average of 39 + 33° 
(s.d.) from the ventral root. By contrast, we observed marked disorga- 
nization of a-MN AIS orientation in hGFAPcre:Sema3a“" animals, 
with increases in both mean angle and variability (62 + 46° (s.d.); 
P<0.001) (Fig. 2d and Extended Data Table 1) at all topographic 
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positions (Fig. 2c and Extended Data Fig. 4). Notably, we did not detect 
any differences in AIS orientation in y-MN (control 52 + 31° versus 
mutant 49 + 29°; P=not significant; Fig. 2e). Furthermore, these 
prominent differences at P7 were no longer detectable by P30 (Fig. 2f). 

To determine whether this AIS orientation defect reflected aberrant 
initial positioning or a failure to maintain MN orientation during later 
growth, we targeted radial glia and AS using AldhIl1cre, which targets 
gliogenic radial glia at E12.5-13.5 (ref. 7). AldhIlicre fate maps most 
AS, some oligodendrocytes (that are Semma3a negative, Extended Data 
Fig. 2d), but excludes SC MN and interneurons (Extended Data Fig. 3b). 
Aldhilicre:Sema3a™" mice were perinatal lethal (28% expected sur- 
vival at P1-P5, n = 112), but with normal numbers of AnkG* MN, 
suggesting that axon specification is not affected in vivo (AnkG* MN 
per confocal section: 42 + 5% versus 46 + 4%, P = not significant). 
At E14.5 (before the major period of AS expansion’), we found no 
evidence of abnormal MN cell body positioning (Extended Data 
Fig. 5) or MN AIS orientation in Aldh1licre:Sema3a™“" animals (con- 
trol 29 + 21° s.d. versus mutant 32 + 30°, P = not significant; Fig. 2g). 
By PO, AIS orientation defects were evident and similar in magnitude to 
those observed in P7 hGFAPcre:Sema3a™" (37 + 25° (s.d.) versus 
66 + 38° (s.d.); P< 0.001; Fig. 2g). As a control, we deleted Sema3a 
in MN using Hb9cre*®”". This did not lead to defects in MN AIS orien- 
tation (31 + 25° control, 30 + 33° Hb9cre:Sema3a™ , P= not signifi- 
cant; Fig. 2h). These findings demonstrate that early developmental 
events—including initial positioning of MN soma and exit of MN axons 
from the ventral root—occur normally despite loss of AS-encoded 
Sema3a. However, with loss of AS-encoded Sema3a, many «-MN fail 
to properly maintain AIS orientation towards the ventral root at early 
postnatal stages. 

To test whether AS-encoded Sema3a acts directly on MN with- 
out other cellular intermediates we co-cultured ventral SC AS from 
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Figure 3 | AS-encoded Sema3a is required for postnatal a-MN survival. 

a, b, Normal numbers of o-MN (Chat NeuN*) and y-MN (Chat* NeuN_ ) 
at P7 in hGFAPcre:Sema3a™ cervical and lumbar SC. Scale bar, 50 pum. 

c, d, Fewer large «-MN and preserved numbers of y-MN (Err3-bright, 
arrows.) in hGFAPcre:Sema3a™" animals at P28-P33. Scale bar, 50 tum. 

e, f, Dose-dependent decrease in MN soma area at P28-P33 with loss of 
AS-Sema3a; histograms show relative depletion of large MN (arrow). 

g, Representative peripherin*/Map2* embryonic rat MN cultured in 
factor-free media or with recombinant Sema3a. Red arrow denotes soma. Scale 
bar, 100 um. h, MN survival with recombinant Sema3a with/without 
Nrp1-blocking antibody. i, Summary. Statistics: mean + s.e.m. e, f, One-way 
mixed-effects analysis of variance (ANOVA) with Tukey’s multiple 
comparison. b, c, h, Student’s t-test. Data in b, ¢ from 4 per genotype from 4 
sections per animal; h average of 4 independent experiments. *P < 0.05, 
***ED < ().0001. 
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Sema3a™" animals at sub-confluent density with embryonic rat MN” 
(Fig. 2i), and added adenoviral Cre (Ade-Cre) to some wells to delete 
Sema3a. We found that the length of proximal segment overlap in MN 
adjacent to AS was significantly increased after Ade-Cre-mediated 
deletion of Sema3a from ventral AS (50 + 9 um versus 111 + 16 um, 
P<0.001; Fig. 2), k), suggesting a failure to properly repel the axon. 
Thus, AS-encoded Sema3a directly repels MN axons both in vivo and 
in vitro. 


AS Sema3a is needed for a-MN survival 


Given these findings, we investigated whether ventral AS-encoded 
Sema3a might also have later roles in maintenance of the sensori- 
motor circuit. At P7, numbers of both «- and y-MN were normal in 
hGFAPcre:Sema3a™" mice (Fig. 3a, b'° and data not shown’). By con- 
trast, by P30 we found a significant (P< 0.05) twofold reduction in 
numbers of surviving Err3-negative’’ «-MN in hGFAPcre:Sema3a™" 
mice (Fig. 3c, d). This was supported by a dose-dependent reduction 
in the average size of MN soma reflecting selective loss of a large MN 
population (Fig. 3e, fand Extended Data Table 1). By contrast, y-MN 
persisted in normal numbers. In addition, two ventral interneuron 
populations (Chx10* interneurons and calbindin* Renshaw cells) did 
not show Sema3a-dependent depletion. In fact, the number of Renshaw 
cells was significantly increased at P30 (Extended Data Fig. 6). 

To further assess direct effects of Sema3a on MN survival, we cultured 
MN in factor-free media for 24h (to allow for initial polarization”*) 
then added recombinant Sema3a proteins. We found that exogenous 
Sema3a promoted MN survival in a dose-dependent manner that was 
abrogated by preincubating with an Nrp1-blocking antibody (Fig. 3g, h). 
Interestingly, the MN that survived in the absence of Sema3a typically 
had bipolar morphology (no add 86 + 7% bipolar versus Sema3a 20 + 
11%, P < 0.01). Together, these findings indicate that AS-encoded Sema3a 
can directly promote MN survival (Fig. 3i) in a manner that is tightly 
linked to its tropic effects. Notably, abnormal MN AIS orientation was 
no longer evident in adult hGFAP-cre:Sema3a" mice (Fig. 2f), sug- 
gesting that misoriented «-MN are lost by adulthood. 


AS Sema3a regulates MN function 


To test whether MN in hGFAPcre:Sema3a™" mutants integrated nor- 
mally into local synaptic circuits (Fig. 4a), we first counted excitatory 
vGlut1* presynaptic puncta on MN soma, which reflect type 1a pro- 
prioceptive sensory afferents’. We observed a significant decrease 
in the number of vGlutl puncta per MN (Fig. 4b; 4.82 + 0.26 versus 
2.91 +0.2; P<0.0001) in adult mutant animals. These differences 
remained highly significant even when only large (>500 um) putative 
a-MN were counted (5.59+0.25 versus 4.13 + 0.23; P<0.001). 
Numbers of excitatory vGlut2 puncta, which are less dependent on 
DRG afferent input***, were not significantly different in hGFAP-cre: 
Sema3a"" animals versus controls (18.29 + 0.93 versus 16.11 + 0.88 
P = not significant; Extended Data Table 1). By contrast, we found sig- 
nificant increases in numbers of VGAT inhibitory presynaptic puncta 
with loss of AS-encoded Sema3a (Fig. 4b; 21 + 0.74 versus 27 + 0.96; 
P<0.0001). 

To test whether these synaptic changes correlated with changes in 
MN function, we performed whole-cell patch clamp electrophysiological 
recordings” of lumbar a-MN at P13-14 from hGFAPcre;Sema3a™: 
ChAT-GFP transgenic reporter mice and Cre-negative controls (Fig. 4c, d). 
As shown (Fig. 4f-i), loss of AS-encoded Sema3a conferred a large shift 
in the balance of excitation/inhibition with significantly decreased 
MN spontaneous excitatory postsynaptic current (sEPSC) frequency 
(control 3.19 + 0.35 Hz versus mutant 1.21 + 0.38 Hz; P<0.01) and 
increased spontaneous inhibitory postsynaptic current (sIPSC) frequency 
(control 0.14 + 0.04 Hz versus mutant 0.84 + 0.15 Hz; P< 0.01). Am- 
plitudes of sEPSC and sIPSC were not changed (Extended Data Fig. 7). 
Interestingly, hGFAPcre:Sema3a™" MN were hyperexcitable as reflected 
by a significant decrease in the rheobase value (control 90.0 + 23.0 pA 
versus mutant 37.5 + 7.2 pA, P< 0.05; Fig. 4d, e) with unchanged input 
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resistances (Extended Data Fig. 7), suggesting compensatory changes 
in MN function. Together, these findings demonstrate that loss of AS- 
encoded Sema3a leads to changes in MN excitatory and inhibitory synap 
tic inputs and has global effects on MN firing properties. 


AS regulate sensory axon targeting 


Initial studies of Sema3a demonstrated selective chemorepellent activity 
for sensory axons expressing TrkA or calcitonin gene-related peptide’*””*, 
which label overlapping sets of DRG neurons”. However, the cellular 
source of Sema3a in SC has never been defined. Sensory axon guidance 
takes place between E13.5 and E18.5 in the mouse SC. As shown (Fig. 1a), 
TrkA~ sensory axons normally synapse in the dorsal horn“’, whereas 
PV" proprioceptive 1a afferents synapse with ventral o-MN". 
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Figure 5 | AS-encoded Sema3a regulates DV positioning of sensory axons. 
a, Dil labelling of upper thoracic SC demonstrates ectopic ventral fibres in 
Aldhilicre:Sema3a""' mice (inset, box, arrows) (n = 7 per group). Scale bar, 
100 tum. b, No ectopic proprioceptive la afferents (white arrow) in 
Aldhilicre:Sema3a"" mice; n = 4 per group. Scale bar, 100 jum. c, Multiple 
ectopic ventral TrkA~ projections in AldhIl1cre:Sema3a™" mice (box, insets). 
Scale bar, 100 jim. d, Overlay dot-plot of TrkA* terminations in Cre-negative 
controls (black circles) and AldhIl1cre:Sema3a™" (red stars); n = 3 per 
genotype. e, Quantification of DV termination index“ shows a significant 
increase in ventral terminations in Aldh 111 cre:Sema3a™ mice. f, Medial versus 
lateral terminations unchanged. g, Summary of sensory axon phenotypes. 
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We characterized sensory axon guidance in AldhIlIcre:Sema3a“" 
animals to determine potential AS-encoded functions (Fig. 5). Notably, 
whereas AldhIllcre activity fate maps to ~5% of DRG cells, Sema3a 
expression was undetectable in the DRG’* (Extended Data Fig. 2b), and 
we observed normal numbers of both PV~ and TrkA* cells in the DRG 
of AldhIl1cre:Sema3a™" mice (Extended Data Fig. 8). Dil labelling of 
DRGs of Aldh Il cre:Sema3a™" mutants at E18.5-P0 showed abnormal 
ventrally positioned axons in 7 out of 7 mutants examined versus 0 out 
of 7 controls (Fig. 5a). 

We then investigated the sub-classes of DRG sensory afferents affec- 
ted in Sema3a mutants. We observed normal DV positioning of PV~ 
type la proprioceptive afferents albeit with subtle fasciculation defects 
(Fig. 5b). By contrast, TrkA* afferents showed numerous abnormal 


SC astros + DRG neurons 


Control .Sema3a~~ AS 
a : e 
TrkAt i 
Pvt ! 


+Ade-Cre 


+ Ade-Cre 


-Virus 


Dorsal 
to 
x 


nD 


2 


Complexity 


| 
425 
Distance from soma (uM) 


Length (mM) 


f=} 


0 
25 


Ventral 


-Virus +Ade-Cre 225 425 


h, Diagram of AS-sensory neuron co-culture protocol. i, Representative TrkA* 
DRG neuron grown on SC AS. Scale bar, 50 um. j, Tracings of wild-type DRG 
neurons grown on dorsal/ventral SC AS from Sema3a™' mice, under wild-type 
(no virus) or Sema3a /~ (+Ade-Cre) conditions. Scale bar, 100 jum. 

k, Reduced total neurite length of neurons grown on ventral (red) vs dorsal 
(blue) AS significantly rescued on Sema3a ‘~ ventral AS. 1, m, Sholl analysis 
shows significantly less branching on ventral AS (1), rescued withSema3a 
deletion (m). Statistics: culture data from 4-6 independent experiments with 
>10 neurons per condition per experiment. Mean + s.e.m; *P < 0.05, 

**D < 0.01, ***P < 0.001, Student’s t-test. 
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ventral terminations in Aldh1licre:Sema3a™" mutants (Fig. 5c-f). Thus, 
AS-encoded Sema3a regulates DV patterning of sensory axon projec- 
tions in a subtype-specific manner in vivo (Fig. 5g), consistent with 
previous results*® and with the neuronal pattern of Nrp1 expression 
(Extended Data Fig. 2). 

To test whether distinctions between regional SC AS exist inde- 
pendent of environmental positional cues, we co-cultured dorsal or 
ventral AS with dissociated DRG sensory neurons (Fig. 5h) using AS 
from Sema3a™" animals and deleted Sema3a by adding Ade-Cre. 
Interestingly, AS cultures retained their distinct regional expression 
characteristics for many of the genes prospectively identified in vivo 
(see Fig. 1b and Extended Data Fig. 9), including higher ventral 
Sema3a levels. 

We observed that both dorsal and ventral AS cultures preferentially 
supported the survival of TrkA-positive sensory neurons (>85% of 
neurons were TrkA*, none were PV" Fig. 5i), with no significant sur- 
vival differences between dorsal and ventral cultures. However, ven- 
tral SC AS cultures significantly (P < 0.01) inhibited neurite outgrowth 
and complexity relative to dorsal AS (Fig. 5j-m). Ade-Cre deletion of 
Sema3a™" normalized differences in neuronal length and complexity 
conferred by ventral AS (Fig. 5j-m). Together, these findings show that 
AS encode subtype-specific sensory axon guidance signals. 


DISCUSSION 


We propose that region-restricted AS comprise a stable ‘scaffold’ that 
maintains positional information throughout embryonic and post- 
natal development. This positional code is necessary for proper cir- 
cuit formation, refinement and neuronal survival in a subtype-specific 
manner. Loss of Sema3a function from AS led to a sequence of «-MN- 
specific phenotypes, comprising defective postnatal maintenance of 
AIS orientation, markedly abnormal inhibitory and excitatory cur- 
rents in MN and abnormal synapse investment, and finally, a-MN 
loss. Concomitantly, in more dorsal regions, AS-encoded Sema3a acts 
to repel TrkA sensory afferent fibres in a subtype-specific manner. It is 
possible that these phenotypes represent a pathological progression, 
or alternatively, that ventral AS-encoded Sema3a has multiple coor- 
dinated functions that determine structural and functional sensori- 
motor circuit integrity. 

Our in vitro studies further suggest that AS positional identity is at 
least partly cell intrinsic, as cognate Sema3a-dependent regional AS 
properties were retained in co-cultures independent of local envir- 
onmental cues. As such, a testable prediction is that embryonic CNS 
patterning mechanisms might establish a template for generation of 
heterogeneous properties of AS”. Furthermore, although maintenance 
of MN axonal orientation represents one tropic effect of AS-encoded 
Sema3a, further investigation is needed to assess other potential roles 
of this or other AS-encoded regional cues, such as promoting dendrite 
growth”””*, maintenance of neuronal soma position, local synaptic 
strength and/or sensorimotor specificity”. 

Specialized local functions of AS in neural circuit formation may 
also have significance in human disease. For example, loss of ventral 
SC MN in amyotrophic lateral sclerosis (ALS) has been associated 
with mutant superoxide dismutase protein in ventral AS in animal 
models of the disease*’. Our findings suggest the possibility that the 
unique identity of ventral horn AS might lead to deficient local sup- 
port for MN and disease progression in ALS. More generally, given 
that AS are regionally patterned throughout the CNS’, the concept of 
regional AS function and dysfunction has implications for a variety of 
neurodevelopmental and psychiatric disorders. 


METHODS SUMMARY 


Animals were maintained in the University of California San Francisco animal 
facility. All protocols were IRB approved and in accordance with the Institutional 
Animal Care and Use Committee guidelines. Circular data and statistics were 
analysed using Oriana4 software (Kovach Computing Services). Astrocyte mono- 
layers from microdissected PO mouse spinal cord were cultured for 12-14 days 
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before re-plating at sub-confluent density for MN co-cultures and high density 
for DRG co-cultures. Experiments with recombinant Sema3a (Peprotech) were 
on Matrigel in serum-free, growth-factor-free media. Nrp1 blockade with 10 pg 
ml’ antibody (R&D AF566) was added 30 min before Sema3a. 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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METHODS 


Mice. All mouse strains were maintained in the University of California San 
Francisco (UCSF) specific pathogen-free animal facility, and all animal protocols 
were approved by and in accordance with the guidelines established by the Ins- 
titutional Animal Care and Use Committee and Laboratory Animal Resource 
Center. Mouse strains not otherwise referenced include CAG-GFP (MGI:3849685) 
and ChAT-eGFP (MGI:3694555). For embryonic tissues, plug date was considered as 
E0.5. Embryo age was confirmed by morphology and crown-rump length mea- 
surements at collection. 

Astrocyte isolation by flow cytometry. Postnatal day 7 spinal cords were micro- 
dissected using an ‘open book’ preparation to separate dorsal and ventral halves. 
DRGs and meninges were removed then dissociated with papain (20U ml‘; 
Worthington) for 80 min at 33 °C as described previously’’. Aldh111-positive and 
-negative cells were sorted as previously described'* on a BD Facs Aria II and gated 
on forward/side scatter, live/dead by DAPI exclusion, and GFP, using GFP-negative 
and DAPI-negative controls to set gates for each experiment. In some cases GFP- 
positive populations were re-sorted using the same gates to >95% purity. 

RNA isolation. RNA was isolated using TRIZOL reagent (Invitrogen) with gly- 
cogen added as carrier, DNase-digested to remove genomic DNA contamination, 
and further purified using the RNAeasy Kit (Qiagen). For microarray analysis, 
RNA samples were amplified using the Nugen Pico WT Ovation Kit and hybri- 
dized to Affymetrix Mouse Gene 1.0 ST arrays. 

Bioinformatics. Microarray data were pre-processed in R using the Biocon- 
ductor suite of software packages. The ‘oligo’ package was used to background 
correct, normalize and summarize 1,102,500 probes on mouse gene 1.0 ST arrays 
via the robust multi-array analysis (RMA) algorithm. After nonspecific filtering 
was applied to remove low-intensity (<100 FU in at least 75% of the arrays), low- 
variance, and un-annotated probe sets, 7,799 probe sets remained. The Limma 
package was used to compare and assess differential expression between different 
groups of samples using the ‘treat’ algorithm. Dorsal astrocytes, ventral astrocytes, 
dorsal non-astrocytes, and ventral non-astrocytes were compared directly for the 
‘dorsal versus ventral’ analysis. To generate an ‘astrocyte versus non-astrocyte’ data 
set, dorsal and ventral astrocyte probe sets were assigned equal weight and com- 
pared against dorsal and ventral non-astrocyte probe sets. With fold-change >1.2, 
false discovery rate = 0.15, we identified 5,158 genes differentially expressed 
between astrocytes and non-astrocytes, and 38 genes (39 transcripts) differentially 
expressed between dorsal and ventral astrocytes. 

qPCR analysis. Complementary DNA was generated from purified RNA using 
Superscript III (Invitrogen) and random decamers. Primers were designed for 
amplicons of 75-150 bp using Primer 3 and are available upon request. qPCR was 
done ona Roche lightcycler 480 using Sybr Green Master Mix (Roche). Melt curves 
were analysed for each experiment to ensure primer specificity. In most cases both 
B-actin (Actb) and Gapdh were used as housekeeping genes for normalization with 
similar results, and Aldh111 was used as a housekeeping gene in vitro. 
Immunohistochemistry/in situ hybridization. Most images were collected 
using a Leica SP5 confocal microscope. Antibodies used included mouse NeuN 
(Millipore), rabbit Sema3a (EMD biosciences), goat ChAT (Millipore), rabbit 
TrkA (gift of L. Reichardt, UCSF), chick GFP (Aveslabs), mouse Parvalbumin 
(Sigma), chick neurofilament (Encor), rabbit ankyrin G (Santa Cruz), mouse Err3 
(PPMX), chick peripherin (Millipore), rabbit Map2 (Millipore) mouse Isl 1/2 
(DSHB), sheep Chx10 (Abcam) and rabbit calbindin (Swant). Rabbit FoxP1 
and guinea pig Scip were provided by J. Dasen. In most cases staining was done 
overnight at 4°C in 5% serum/0.4% Triton, following heat-mediated antigen 
retrieval for 2min at 95°C in 0.1M citrate buffer, pH6.0. For GFP labelling 
antigen retrieval was 10 min at 70°C. For TrkA staining, no retrieval was used, 
and slides were stained overnight at room temperature (20 °C) in 5% serum/1% 
Triton. In situ hybridization was performed using standard protocols. The probe 
for the full-length rat Sema3a transcript was provided by A. Kolodkin, the Wnt7a 
probe was from A. McMahon and Plp1 probe was from I. Griffiths. 

Polarity analysis in vivo. Measurements of motor neuron orientation relative to 
the ventral root were calculated by measuring the angle between the following: 
(1) the vector from the MN nucleus to the axon hillock, marked by AnkG staining 
of the proximal axon segment; and (2) the vector from the MN nucleus and the exit 
point of the ventral root from the grey matter. Angle measurements ranged from 
0° (axon pointing towards the ventral root) to 180° (axon pointing directly away 
from ventral root). Circular data was analysed using Oriana4 software (Kovach 
Computing Services), and statistical analyses performed using a Watson’s U? test. 
Synapse and soma diameter counts. Counts of synaptic puncta on MN soma 
were performed at cervical (C4—5) and lumbar (L3-4) levels. Counts in the main 
figures represent pooled unbiased data from all levels with no size cutoff; however, 
all data were analysed in histogram format and sorted by MN soma area to deter- 
mine whether results were likely to be biased by altered ratios of MN subtypes. For 
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some subgroup analyses (Extended Data Table. 1), putative a-MN and y-MN 
were identified by size on the basis of histogram analyses of soma area. 
Astrocyte cell culture. Dorsal and ventral spinal cords from PO-1 mice were 
isolated and dissociated as above. Cells were plated at a density of >1 X 10° per 
25 cm’ flask in DMEM-hi glucose with 10% FCS/10 1M hydrocortisone, 5 pg ml * 
N-acetylcysteine, 2 1g ml’ insulin and 20ngml~’ EGF. Six days after plating 
cells, flasks were shaken to remove oligodendrocyte contamination. At 8 days, 
AraC was added to kill rapidly proliferating cells. For DRG co-cultures, 10-12 days 
after initial plating cells were re-plated into assay containers that consisted of 8-well 
glass chamber slides (BD) coated with poly-p-lysine and recombinant human 
fibronectin (Biomedical Technologies) to promote astrocyte adhesion, plating 
30,000 cells per well. In most cases, cultures were established using Sema3a™" 
mice, and adenoviral Cre recombinase (Vector Biolabs) was added to some wells 
2-4h after re-plating. For MN co-cultures, astrocytes were re-plated at 2,000 cells 
per well onto a reduced GF Matrigel substrate (BD) diluted 1:25 in DMEM. 
Astrocyte monolayers were then cultured for 2-3 days before adding neurons. 
DRG isolation and co-culture. DRGs from E13.5-14.5 mouse embryos were iso- 
lated and dissociated for 45 min in 0.25% trypsin/EDTA (Invitrogen). Five-hundred 
cells per well were plated onto astrocyte monolayers in minimal neural growth media 
containing DMEM:F12, 10% FCS, N2 and B27 supplements (Invitrogen), and co- 
cultured for 48 h before fixation in 4% paraformaldehyde and immunolabelling. 
Motor neuron isolation and co-culture. Spinal cord neurons were isolated from 
embryonic rat spinal cords on the basis of previous protocols”. In brief, spinal 
cords were dissected from E15 rat embryos, dissociated in 0.25% trypsin (Gibco) 
for 15 min and triturated to form a single-cell suspension in L-15 plus 10% FBS 
media (Gibco). The suspension was immunopanned in a series of negative selec- 
tion plates against rat neural antigen 2 (Ran2) and galactocerebroside, and then 
motor neurons were positively selected for on a final p75NTR panning plate. 
Adherent cells were released from the plate with a brief application of 0.05% trypsin 
(Gibco) and re-suspended in growth media (DMEM, B27, N2, Pen-Strep (Gibco)) 
before culturing. At re-plating (onto Matrigel-plated sub-confluent astrocytes for 
co-cultures, or Matrigel-coated wells for recombinant Sema3a experiments), fresh 
media was added consisting of DMEM hi-glucose supplemented with N2 and B27 
supplements 5 pg ml~' N-acetylcysteine, 5 pg ml’ insulin and 5 (1M forskolin. 
Recombinant Sema3a addition and Nrp1 blockade in vitro. MN were plated at 
1,000 cells per well as above. 12 h after plating, they were assessed by light micro- 
scopy, at which point most could be seen to have budded polar processes. 24h 
after plating, recombinant human Sema3a was added (Peprotech) at indicated 
concentrations. For Nrp1 blockade, an Nrp1-blocking antibody validated for this 
purpose (R&D Systems, AF566) was added at a concentration of 10 jigml~' and 
incubated with MN for 30 min before Sema3a addition. Goat IgG was added 
separately to control for nonspecific effects (not shown). Cells were cultured for 
an additional 36h before fixation and immunolabelling. 

Culture image analysis. For DRG co-cultures, neurons were identified with NF-H 
(Encor), in some cases TrkA, and PV immunostaining was used to identify sensory 
neuron subtypes. MN were identified with peripherin and traced with an overlay of 
peripherin and Map2 (to label distal dendrites). Spatially distinct neurons were 
photographed and analysed using Neuron] plugin/Image J to trace total neurite 
length for each neuron. Sholl analyses were performed using the Sholl analysis 
plugin after thresholding each neuron using Image J (parameters: start 25 11M, step 
size 10 uM). More than ten neurons per condition per experiment were analysed. 
For axon overlap in vitro, only MN directly abutting an astrocyte were included in 
the analysis, and overlap of proximal axon (defined as the axon segment proximal 
to the first branch point) was quantified using ImageJ. For neurite length, statistical 
analyses were performed on means of each experiment, and in Sholl and axon 
overlap analyses, t-tests were performed on pooled data. 

Whole-cell patch clamp recordings. Acute fresh lumbar (L3-4) spinal cord slices 
were prepared from hGFAP-cre:Sema3a™":ChAT-GFP mice and Cre-negative 
controls from P12-14 using previously described protocols and solutions’’. In 
brief, transverse slices (350-j1m thick) were cut with a vibratome (Leica Micro- 
systems) in a chamber filled with ice-cold sucrose cutting solution followed by a 
brief (60s) incubation in polythethylene glycol (Mn = 1,900-2,200). The slices 
were then incubated in cutting solution at 35° C for 30 min followed by 30 min in 
artificial cerebrospinal fluid then equilibrated to room temperature. Whole-cell 
recordings were made using patch clamp amplifiers (Multiclamp 700B) under an 
infrared-differential interference contrast microscope. Data acquisition and ana- 
lysis were performed using digitizers (DigiData 1440A) and analysis software 
pClamp 10 (Molecular Devices). Signals were filtered at 6 kHz and sampled at 
20 kHz. sEPSCs were recorded from motor neurons at —75 mV in voltage-clamp 
mode where the chloride reversal potential was 0 mV. sIPSCs were recorded at 
—55 mV where chloride currents were positive deflections in voltage clamp. Glass 
pipettes with a resistance of 2.5-4 MQ were filled with a K-methanesulphonate 
internal solution. To ensure currents measured were sEPSCs and sIPSCs, control 
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recordings were performed using standard pharmacology (Extended Data Fig. 7). 
Series resistance (15-25 MQ) was monitored throughout the whole-cell recording 
and data were discarded if the change in series resistance was >20% during the 
course of the experiment. 

Dil labelling of sensory afferents. Spinal cords from E18.5-PO mice were fixed 
for at least 48 h in 4% paraformaldehyde, then dissected from spinal column with 
DRGs intact. The ventral root was severed and a crystal of Dil (Molecular probes) 
was applied to the DRG with a needle. Cords were incubated for 12 days at 37 °C, 
then 300-j1m thick sections were cut by vibratome, mounted in PBS, and imaged 
by collecting confocal z-stacks at 2 |1M spacing. 

Sensory neuron termination analysis. Confocal sections of cervical spinal cords 
were overlaid with a uniform grid using Adobe Illustrator and terminations per 


image were labelled. When more than one break in a process was visible, only the 
distal-most termination was labelled. Data from all sections and animals were 
overlaid on a normalized grid and the number of terminations per quadrant 
calculated. 

Statistical analysis. Consultation obtained from the UCSF department of 
Biostatistics. Coefficient of variance (standard error as a percentage of the mean) 
was used to calculate minimum sample sizes. In most cases, sample sizes were well 
in excess. Student’s t-tests were two-tailed and based on Gaussian distributions. 
In all cases, replicates refer to biological rather than technical replicates. Blinding 
during analysis was used whenever possible for all in vivo studies. Most statistics 
analysed using Graphpad Prism software. Circular data in Fig. 2 were analysed 
using Oriana4 software (Kovach Computing Services). 
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Extended Data Figure 1 | Flow cytometry gating strategy and microarray. _ between dorsal and ventral (not shown). b, Summary of differentially expressed 
a, Schematic indicating microdissection of Aldh111-GFP-positive P7 spinal genes in astrocytes (AS), whole cord, or both using the analysis parameters 
cord and isolation by flow cytometry using scatter gates, doublet exclusion (not —_ indicated. c, Heatmap of all 39 genes differentially expressed between dorsal 
shown) and sorting for GFP-positive cells with live/dead exclusion by DAPI and ventral cord, highlighting astrocyte-enriched genes with known roles in 
staining. Percentage of Aldh1l11-GFP cells was not significantly different neural circuit development (red) or extracellular matrix (blue). 
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Extended Data Figure 2 | Coordinate expression of Sema3a and Nrp1 in 
astrocytes and neurons. a-c, Sema3a mRNA is expressed in radial glia (RG) 
and in protoplasmic cells that are NeuN negative throughout the embryonic 
and early postnatal period. Sema3a was not detected in DRG or in SC white 
matter (b). d, Sema3a is segregated from Plp-positive oligodendrocytes. e, MN 
Sema3a expression is detected in a-MN but not y-MN in cervical SC. f, g, High 
levels of Nrp1 expression in TrkA® fibres and cell bodies (white arrowhead) 
and in MN, but not in PV-positive fibres and cell bodies (yellow arrows). 

h, Quantification of percentage of Nrp1~ neurons per condition. 
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Extended Data Figure 3 | Fate map of conditional astrocyte deletion lines maps to astrocytes but not to neurons in P10 SC, including «-MN (purple), 
used in this study. a, hGFAPcre fate map labels fibrous and a subset of y-MN (blue) and interneurons (red). 
protoplasmic AS but not MN or interneurons in P10 SC. b, AldhIlicre fate 
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Extended Data Figure 4 | Motor neuron AIS orientation defects in cervical _ representative MN with identifiable AIS and a schematic of their location with 
spinal cord. a, Representative images of cervical SC confocal sections stainedto _ respect to the ventral root. c, Overlay of all cervical a~MN angles measured to 
distinguish «— and y-MN and identify their proximal axon segment generate data summarized in Fig. 2c, with positional information preserved, 
(asterisk denotes ventral root). b, Inset shows high-magnification view of demonstrates that misoriented AIS can be seen at all DV positions. 
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Extended Data Figure 5 | No evidence of abnormal MN cell body using Scip (a PMC and LMC marker. c, d, No obvious differences in DV or 
positioning with loss of astrocyte-encoded Sema3a. a, Representative mediolateral boundaries of ChAT * MN at comparable cervical or lumbar levels 
FoxP1 Islet1/2 co-labelling at three rostrocaudal levels in control and mutant at PO (using AldhIllcre to delete Sema3a) and P7 (with hGFAPcre), both time 
animals shows no differences between control and mutant. b, Similar stainings _ periods where misorientation of AIS is clearly evident. 
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Extended Data Figure 6 | Quantification of ventral interneuron quantification demonstrates a significant increase at this age. Data are 


populations after loss of astrocyte-encoded Sema3a. a, Chx10 stainingatE18 mean +s.e.m., student’s t-test. Data in a from n = 2 per group, 4 sections per 
and quantification. b, Calbindin staining of Renshaw interneurons at P30 and —_ animal; data in b from 4 per group, 4 sections per animal. 
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Extended Data Figure 7 | Additional data and controls for MN (2R)-amino-5-phosphonovaleric acid (AP5) block postsynaptic currents 
electrophysiology. a, 2 .M strychnine and 20 uM bicuculline block (at —75 mV) in a ChAT-GFP* lumbar MN. c, No difference in input 
postsynaptic currents (at —55 mV) in a ChAT-GFP* lumbar MN. b, 20 1M resistance, s[PSC amplitude or sEPSC amplitude between control (Cre-) and 
6,7-dinitroquinoxaline-2,3-dione (DNQX) and 50 uM hGFAPcre:Sema3a™ (fl/fl) MN. n = 5/each; mean + s.e.m., Student’s t-test. 
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Extended Data Figure 8 | Normal dorsal root ganglia in 
Aldh1LIcre:Sema3a™" mice. a, No difference in the number of 
subtype-specific neurons per DRG in control or AldhIl1cre:Sema3a™" mice 
(n = 3 from 4-5 sections per animal; mean + s.e.m.; Student’s t-test). 
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Extended Data Figure 9 | Differential expression of regionally 
heterogeneous astrocyte genes is partly preserved in vitro. qPCR 
quantification demonstrates that many regionally heterogeneous microarray 
genes prospectively identified in vivo remain differentially expressed in vitro 
after 17 days in culture, including ventral Sema3a. Mean + s.e.m., n = 3 
independent experiments. 


©2014 Macmillan Publishers Limited. All rights reserved 


ARTICLE 


Extended Data Table 1 | Subgroup analyses of motor neuron data presented in Figs 2 and 3. 


Control (cre-) hGFAPcre: Sema3a"" p-value 
MN axon orientation, P7 ~— All MN 42433°(SD) 58442°(SD) = 
(° from ventral root) Cervical MN 42434°(SD) 62449°(SD) si 
Lumbar MN 42432°(SD) 55+42°(SD) * 
o-MN (NeuN+) 39433°(SD) 62+46°(SD) rn 
y-MN (NeuN-) 52+31°(SD) 49+29°(SD) ns 
MN soma size, P30 All MN 664414 483+10 ad 
(uM?) Cervical MN 599424 497416 ane 
Lumbar MN 687417 469414 = 
#vGlut1/MN All MN 4.840.26 2.9+0.20 ci 
Cervical MN 4.3+40.29 2.9+0.19 — 
Lumbar MN 5.140.33 3.340.33 ma 
o-MN (>500 pM) 5.6+0.25 4.1+40.24 ee 
y-MN (<500 pM) 2.27+40.28 1.95+0.19 ns 
#VGAT/MN All MN 2140.74 2740.96 ite 
Cervical MN 19.8841.1 24.541.5 ‘if 
Lumbar MN 21.8+1.0 27.841.4 bi 
o-MN (>500 pM) 25.741.1 37.541.3 = 
y-MN (<500 pM) 11.4+0.82 22.7+1.0 sia 
#vGlut2/MN All MN 18.29+0.93 16.11+0.88 ns 
Cervical MN 15.0+0.82 13.0+0.68 ns 
Lumbar MN 23.541.4 21.741.4 ns 
o-MN (>500 pM) 22.9841.0 21.2141.2 ns 
y-MN (<500 pM) 12.18+0.81 11.7+0.57 ns 
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Synapse elimination and learning rules 
co-regulated by MHC class I H2-D” 


Hanmi Lee', Barbara K. Brott’, Lowry A. Kirkby’, Jaimie D. Adelson’, Sarah Cheng’, Marla B. Feller?, Akash Datwani!+ 


& Carla J. Shatz! 


The formation of precise connections between retina and lateral geniculate nucleus (LGN) involves the activity-dependent 
elimination of some synapses, with strengthening and retention of others. Here we show that the major histocompatibility 
complex (MHC) class I molecule H2-D? is necessary and sufficient for synapse elimination in the retinogeniculate system. 
In mice lacking both H2-K” and H2-D? (K°D°- ‘~), despite intact retinal activity and basal synaptic transmission, the 
developmentally regulated decrease in functional convergence of retinal ganglion cell synaptic inputs to LGN neurons fails 
and eye-specific layers do not form. Neuronal expression of just H2-D? in K°D”~'~ mice rescues both synapse elimination 
and eye-specific segregation despite a compromised immune system. When patterns of stimulation mimicking endogenous 
retinal waves are used to probe synaptic learning rules at retinogeniculate synapses, long-term potentiation (LTP) is 
intact but long-term depression (LTD) is impaired in K?D”~'~ mice. This change is due to an increase in Ca** -permeable 
AMPA (a-amino-3-hydroxy-5-methyl-4-isoxazole propionic acid) receptors. Restoring H2-D° to K°D’~'~ neurons 
renders AMPA receptors Ca?* impermeable and rescues LTD. These observations reveal an MHC-class-I-mediated 
link between developmental synapse pruning and balanced synaptic learning rules enabling both LTD and LTP, and 


demonstrate a direct requirement for H2-D” in functional and structural synapse pruning in CNS neurons. 


Early in development, before photoreceptors function, retinal ganglion 
cells (RGCs) spontaneously generate correlated bursts of action poten- 
tials called ‘retinal waves’. Postsynaptic LGN neurons in turn are dri- 
ven to fire in similar patterns*’, and this endogenous activity is even 
relayed further into the visual cortex’. Although there is consensus that 
retinal waves and correlated activity are needed for RGC synapse remod- 
elling and segregation of RGC axons into eye-specific layers’®, little is 
known at the synaptic or molecular level about how natural patterns 
of activity are read out to drive elimination and structural remodelling 
before sensory experience. It is assumed that synaptic learning rules are 
present at retinogeniculate synapses, and that implementation of these 
rules ultimately leads either to synapse stabilization or elimination. Efforts 
to discover molecular mechanisms of developmental synapse elimina- 
tion have implicated several unexpected candidates, all with links to 
the immune system, including neuronal pentraxins, Complement Clq 
and MHC class I family members”"’. However, it is not known whether 
any of these molecules regulate plasticity rules at developing synapses. 
Moreover, because germline knockout mice were examined in each of 
these examples, it is not known whether neuronal versus immune func- 
tion is required for synapse elimination in vivo. Here we examine these 
questions and also test whether genetically restoring H2-D” expression 
selectively to CNS neurons in vivo can rescue synapse elimination in 
mice that nevertheless lack an intact immune system. 


Defective synapse elimination in LGN of K°D°~‘~ mutant 
mice 

MHC class I genes H2-D? and H2-K”, members of a polymorphic family 
of over 50, are expressed in LGN neurons" and were discovered in an 
unbiased screen in vivo for genes regulated by retinal waves: blocking 
this endogenous neural activity not only prevents RGC axonal remod- 
elling’, but also downregulates expression of MHCI messenger RNA”. 
Previous studies have suggested that MHCI molecules regulate synapse 


number in cultured neurons’’ and are needed for anatomical segrega- 
tion of RGC axons into LGN layers in vivo’. To examine if H2-K® 
and H2-D? are involved in functional synapse elimination, whole-cell 
microelectrode recordings were made from individual neurons in wild- 
type or K’D’ '~ LGN slices (Fig. 1a)'*!®. Adult mouse LGN neurons 
normally receive strong monosynaptic inputs from 1-3 RGC axons, 
but in development, many weak synaptic inputs are present. Most are 
eliminated between postnatal day 5 (P5) and P12 before eye opening, 
while the few remaining inputs strengthen, resulting in adult-like syn- 
aptic innervation by P24—P30 (ref. 15). By gradually increasing optic 
tract stimulation intensity, individual RGC axons with progressively 
higher firing thresholds can be recruited"*, generating a stepwise series of 
excitatory postsynaptic currents (EPSCs) recorded in each LGN neuron. 
For example, at P21 in wild type, only two steps are present (Fig. 1b), 
indicating that just two RGC axons provide input to this LGN neuron, 
as expected. In contrast, in K’p’-'— LGN neurons, there are many 
EPSC steps (Fig. 1c), a pattern similar to that in much younger wild- 
type mice before synapse elimination’*"*. 

To obtain more quantitative information, minimal stimulation was 
used to estimate single fibre strength (SF-AMPA)”’ (Methods and Extended 
Data Fig. 1a, b). On average, the amplitude of SF-AMPA in Rp? — 
neurons is almost half that of wild type, and the cumulative probability 
distribution of EPSC amplitudes recorded from K’D® /~ LGN neurons 
is also consistent with the presence of smaller sized EPSCs (Fig. 1d; 
note onset latency of SF-AMPA is similar in both genotypes (Extended 
Data Fig. 1c)). In contrast, maximal synaptic input (Max-AMPA) is 
not different between wild type and K’D’ ‘~ (Extended Data Fig. 1d). 
Fibre fraction, an index of how much each input contributes to total 
synaptic response!® (Methods), is half as large in K’D’ /~ than wild 
type (Fig. le), consistent with the idea that the number of RGC syn- 
apses in K’D” /~ LGN neurons is greater than in wild type. An alterna- 
tive possibility—that differences can arise from altered probability of 


1Departments of Biology and Neurobiology and Bio-X, James H. Clark Center, 318 Campus Drive, Stanford, California 94305, USA. *Department of Molecular and Cell Biology & Helen Wills Neuroscience 
Institute, University of California, Berkeley, California 94720, USA. +Present address: Sage Bionetworks, 1100 Fairview Avenue N., Seattle, Washington 98109, USA. 
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Figure 1 | Failure of retinogeniculate synapse elimination despite intact 
retinal waves in K?D®~/~. a-e, Impaired synapse elimination in K’D?~/~ 
mice at P20-P24. a, Slice preparation used for whole-cell recording from dLGN 
neurons and stimulation of retinal ganglion cell (RGC) axons in the optic tract. 
The retinogeniculate projection is visualized by injecting CTB AF488 (green) 
into the contralateral eye. b, c, EPSC amplitude versus optic tract stimulus 
intensity. Insets: example traces. d, Cumulative probability histograms of single 
fibre synaptic strength (SF-AMPA). Inset: mean + s.e.m. for wild type (WT) 


release—is unlikely because paired-pulse ratio, an index of presynaptic 
release probability, is similar in wild type and K’D’ ‘~ at a variety of 
stimulus intervals (Extended Data Fig. le-h). Together, these experi- 
ments, which directly measure the functional status of synaptic innerva- 
tion, demonstrate that either or both H2-K? and H2-D? are required 
for retinogeniculate synapse elimination. 


Intact retinal wave activity in K°D?~'~ mice 


Many previous studies have shown that retinogeniculate synapse elim- 
ination and eye-specific segregation in LGN fail if retinal waves are 
blocked or perturbed”’*. Thus, waves could be absent or abnormal in 
K’D’ '~ mice. To examine this possibility, waves were recorded using 
a multi-electrode array to monitor action potential activity from many 
ganglion cells in KD’ ‘~ or wild-type retinas between P5-P12, the 
peak period of extensive RGC synapse remodelling requiring waves. 
The spatio-temporal pattern of waves in K’D’ ‘~ is indistinguishable 
from wild type (Fig. 1f and Extended Data Fig. 2a—e). Moreover, the 
correlation index between all RGC pairs, a measure of the distance over 
which cells fire together'*’’, is almost identical (Fig. 1g and Extended 
Data Fig. 2a). Retinal wave activity also transitioned normally from 
cholinergic-dependent stage II (P5-P8) to glutamatergic-dependent 
stage III (P10-P12) (Extended Data Fig. 2a-e)*. After eye opening, 
vision in K°D’~'~ mice is also normal". Thus, synapse elimination 
and eye-specific segregation fail to occur despite intact retinal activity 
patterns in R’p? mice, indicating that one or both of these MHCI 
proteins acts downstream of activity to drive synapse remodelling. 


Neuronal H2-D” rescues elimination and segregation 

H2-D? and H2-K” are also critical for immune function and CD8 T-cell 
development’. Both MHCI molecules are expressed in LGN during 
the period of retinogeniculate synaptic refinement, with H2-D” expressed 
at a higher level than H2-K? (refs 10, 14). To separate a contribution of 
the immune system, and to examine if neuronal expression is sufficient 
for synapse elimination, H2-D? expression was restored exclusively to 
neurons by crossing K’D’/~ mice to NSED’* mice in which H2-D> 
expression is regulated under the neuron-specific enolase (NSE) pro- 
moter”. ‘Rescued offspring littermates have H2-D” expression restored 
to CNS neurons while the rest of the body remains Rep? - (K° De! ~s 
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(n = 12/N=6); K°D’'~ (n= 23/N= 8), *P< 0.05. e, Fibre fraction for 
wild type (n = 12/N= 6); K°D’ /~ (n= 21/N = 8), **P <0.01, t-test for 

d, e. f, g, Intact retinal waves in K’p’~~ at P10-P12. f, Raster plots of single- 
unit spike trains recorded from 10 representative RGCs during retinal waves. 
g, Correlation indices versus inter-electrode distance for all cell pairs for wild 
type (N = 5) versus K’p’-/~ (N= 6). Data correspond to mean values of 
medians from individual data sets and error bars represent s.e.m. n = cells/ 
N= animals. 


NSED°*); ‘control’ littermates (K°D’/~;NSED’) lack H2-K® and 
H2-D? throughout the body (Extended Data Fig. 3a). Genomic rescue, 
as well as low but highly significant levels of H2-D® mRNA (P = 0.0001) 
and protein can be detected in RK? De~'~:NSED?* thalamus at P10 
(Extended Data Fig. 3b-e). In contrast, no H2-D? can be detected in 
spleen, gut or liver, with little if any expression in retina, hippocampus 
and cortex of K°p’~/ ~:NSED?* mice. 

In K’D” ‘;NSED’* LGN neurons, only 1-2 EPSC steps could be 
evoked in response to increasing optic tract stimulus intensity (Fig. 2a), 
similar to the mature wild-type innervation pattern (see Fig. 1b) but very 
different from littermate K°-D” /~;NSED®” controls (Fig. 2b). Minimal 
stimulation also revealed an increase in SF-AMPA strength (Fig. 2c and 
Extended Data Figs 1c and 4b). Max-AMPA is similar between these 
genotypes (Extended Data Fig. 4a); thus fibre fraction in K’D’‘~ ;NSED?* 
LGN neurons is 56%, versus 25% in K°D’ '~;NSED’~ neurons (Fig. 2d)— 
also markedly similar to wild type fibre fraction (see Fig. le). Thus, 
expression of H2-D? in neurons rescues RGC synapse elimination in 
LGN of K’D’/~ mice close to wild-type levels. 

The formation of the adult anatomical pattern of eye-specific segrega- 
tion in the LGN involves synapse elimination: initially intermixed ret- 
inal ganglion cell axons from the right and left eyes remodel, eventually 
restricting their terminal arborizations to the appropriate LGN layer”. 
To examine whether eye-specific segregation in the LGN is also res- 
cued, anatomical tract tracing methods'*”’ were used at P34, an age 
chosen because it is more than 3 weeks after segregation is normally 
complete as assessed anatomically. The retinogeniculate projections in 
LGN of K’D’'~;NSED?* mice appear almost indistinguishable from 
wild type, both in eye-specific pattern (Fig. 2e) and in per cent ipsi- 
contra overlap (Fig. 2f and Extended Data Fig. 5a, b). Segregation is 
impaired in control R’p?! ~:NSED?~ littermates (Fig. 2e), as expected 
from previous studies of KD’ ‘~ mice!*. These anatomical results 
support the electrophysiological studies above and strongly suggest 
that both RGC synapse elimination and eye-specific segregation require 
neuronal H2-D°. 


Impaired LTD with natural activity patterns 


Synapse elimination is thought to involve cellular processes leading to 
synaptic weakening such as LTD**”*; conversely LTP-like mechanisms 
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Figure 2 | H2-D° expression in neurons rescues synapse elimination and 
eye-specific segregation in K°D’~/~ LGN. a-d, Rescue of synapse 
elimination at P20-P24. a, b, EPSC amplitudes versus optic tract stimulus 
intensity. Insets: example traces. c, Cumulative probability histogram of 
SF-AMPA. Inset: mean + s.e.m. for control: D>~ (Kep?! ~:NSED”-, 

n= 19/N=5). Rescue: D°* (K’D’/";NSED’*, n = 17/N = 7), **P< 0.01. 
d, Fibre fraction is also rescued in K°D’~/~ ;NSED?* (n = 16/N = 7) compared 
to K’D’'";NSED’~ (n= 18/N = 5), *P <0.05, Mann-Whitney U-test for 

c, d. Horizontal grey bars delineate Fig. le data (mean + s.e.m.). e, f, Rescue of 
eye-specific segregation in K’D’ /~;NSED"* at P34. e, Top: coronal sections of 
dLGN showing pattern of retinogeniculate projections from the ipsilateral 
(green) and contralateral (red) eyes. Bottom: region of ipsi-contra pixel (white) 
overlap between the two channels at 60% intensity threshold (T60%). 

f, Percentage of dLGN area occupied by ipsi-contra overlap. mean + s.e.m. for 
K’p? /-;NSED’~ (N = 3) and K’D’~'" ;NSED’* (N = 4) (T60%) (*P < 0.05, 
two way ANOVA) (see Extended Data Fig. 5). Horizontal grey bar indicates 
wild-type value at T60% (from ref. 14). n = cells/N = animals. 


are postulated for synaptic strengthening and stabilization”®”’. In addi- 
tion, spike-timing-dependent mechanisms are crucial in Xenopus tec- 
tum for visually driven tuning of receptive fields**. In mammalian LGN, 
LTP” or LTD” can be induced at retinogeniculate synapses using 100 Hz 
optic tract stimulation, which is far from the endogenous bursting pat- 
terns generated by retinal waves (Fig. 1f, g)'*. However, realistic patterns 
of optic tract stimulation mimicking waves, paired with postsynaptic 
depolarization of LGN neurons, have also been used; results revealed a 
synaptic learning rule that generates LTP when pre- and postsynaptic 
activity are coincident*"**, but LTD when presynaptic optic tract activ- 
ity precedes postsynaptic LGN depolarization within a broad window 
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corresponding to the 60-90 s duty cycle of retinal waves (Fig. 3a—c and 
Extended Data Fig. 2b)’!. Moreover, using these timing patterns in 
conjunction with optogenetic stimulation of retina is sufficient either 
to drive or prevent segregation of RGC axons depending on the pattern”’. 
To determine whether synaptic learning rules based on natural activity 
patterns are altered at K’D” ‘~ retinogeniculate synapses, perforated patch 
recordings were made in LGN slices from wild type versus K’D’ ‘~ at 
P8-P13, the relevant period when extensive synapse elimination and 
eye-specific segregation are actually occurring. First, paired-pulse stimu- 
lation was used to examine release probability: the same amount of syn- 
aptic depression was observed in wild-type and K’D'~ mice, indicating 
similar probabilities (Extended Data Fig. 6). Next, synchronous activity 
patterns were used, in which 10 Hz optic tract stimulation was paired 
with LGN depolarization (Fig. 3a, b:0 ms latency), generating 10-20 Hz 
bursts of action potentials in LGN neurons mimicking retinal waves*”. 
In wild type, synchronous stimulation induced LTP (Fig. 3d, f| 117 + 8% 
over baseline; P < 0.001). In K’p’-'- LGN neurons, the same protocol 
elicited LTP indistinguishable from wild type (Fig. 3e, f). 

In contrast, induction using asynchronous activity patterns reveals 
a defect in LTD. In wild type, when optic tract stimulation precedes 
LGN neuron depolarization by 1.1 s (Fig. 3a, c: 1,100 ms latency), LTD 
results (Fig. 3g, i: 12% decrease from baseline; P < 0.001). In contrast, 
in K°D’ ‘~, the same induction protocol failed to induce synaptic 
depression; if anything, a slight but significant potentiation was seen 
(Fig. 3h, i: 5% increase from baseline; P< 0.005). Thus, whereas LTD 
using asynchronous pre- and postsynaptic activity patterns is a robust 
feature of wild-type retinogeniculate synapses during the period of 
synapse elimination and eye-specific layer formation, it seems to be 
absent in K’D’ ~~ mice. This impairment is consistent with the fail- 
ure of synapse elimination and axonal remodelling in K’D’ ‘~ mice. 


Ca?* -permeable AMPA receptors at K°D?~‘~ synapses 
Impaired LTD in K’D’ ‘~ mice could be due to altered regulation of 
NMDA (N-methyl-p-aspartate)-receptor-mediated synaptic responses, 
as LTP and LTD are known to be dependent on NMDA receptors at a 
variety of synapses**. Surprisingly, the NUDA/AMPA ratio was not 
different between genotypes (Extended Data Fig. 7a, b). However, the 
kinetics of I, pa recorded in K’p?~'~ LGN neurons are markedly pro- 
longed compared to wild type (Fig. 4a—d). The slowed decay in K’D”/~ 
EPSCs is unlikely to be due to different peak I,up4 amplitudes (P > 0.1; 
Fig. 4d), but could occur if there were greater Ca” influx through 
AMPA receptors. 

Ca’* -permeable AMPA (CP-AMPA) receptors are blocked selectively 
by bath-applying the specific antagonist NASPM, a synthetic homo- 
logue of joro spider toxin*‘, Indeed in K’D’~/~ LGN neurons, 100 uM 
NASPM blocked 40% of the current recorded at —70 mV but only 20% 
in wild type (Fig. 4e and Extended Data Fig. 7c), confirming a two-fold 
increase in CP-AMPA receptor-mediated currents in K’p’~ neu- 
rons. Another diagnostic feature of CP-AMPA receptors is rectification 
in the current-voltage (I-V) relationship when spermine is present in 
the internal recording solution****. In wild type, the I-V relationship is 
linear. However in K’?D’~/~ LGN neurons, rectification is very prom- 
inent (Fig. 4f, g and Extended Data Fig. 7d) but can be linearized close 
to wild-type levels by bath application of NASPM (Fig. 4f, g), indicating 
that the prominent I-V rectification in K’D’ ‘~ arises from an increase 
in CP-AMPA receptors. 

Differences in composition of GluR subunits are known to modulate 
AMPA receptor Ca*" permeability, and tetramers containing GluR2 
confer Ca?* impermeability**. Indeed, the ratio of GluR1 to GluR2, the 
most prevalent subunits***””, is slightly increased in developing thal- 
amus from K°D?~/~ mice (30% increase in K’p?’ ~,P = 0.07; Extended 
Data Fig. 7e). The thalamus is highly heterogeneous, so we also examined 
cortical neuronal cultures: the ratio of GluR1 to GluR2 is also signifi- 
cantly increased (230% increase in RK? De; P = 0.03; Extended Data 
Fig. 7f). Elevated levels of GluR1 subunits suggest that AMPA receptors 
in K’D°’~'~ are more likely to be composed of GluR1 homomers, 
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Figure 3 | Impaired LTD but intact LTP at retinogeniculate synapses in 
ep ~ induced with natural activity patterns. a, Diagram illustrating basis 
for timing-dependent plasticity at developing retinogeniculate synapses. 
Spontaneous retinal waves propagate from “a, b’ towards ‘c, d’; neighbouring 
RGCs fire synchronously but asynchronously with respect to RGCs located 
elsewhere. Waves drive action potentials in postsynaptic LGN neurons with 
varying time delays between pre- and postsynaptic activity. Ages P8-13 were 
studied. b, c, Top: conditioning protocol for LTP (0 ms latency; b) or LTD 
(1,100 ms latency; c). Bottom: example membrane potential changes recorded 
in LGN neuron during conditioning protocol. OT, optic tract. d-f, Intact LTP 


yielding increased Ca** permeability. Together, results point to an increase 
in CP-AMPA receptors in K’D? ‘~ mice. Similar increases in CP-AMPA 
receptors at other synapses are known to shift synaptic learning rules 
away from LTD and towards LTP”. If so, the deficit in LTD observed 
with the asynchronous pairing protocol (Fig. 3c, i) in KD’ ‘~ LGN 
should be rescued using NASPM to block CP-AMPA receptors—just 
what is observed (Extended Data Fig. 8). 


Neuronal H2-D” rescues synaptic function and LTD 

IfH2-D” affects synapse elimination by regulating properties of AMPA 
receptors, then retinogeniculate EPSCs should be rescued to wild type 
in the LGN of K°D’-/~;NSED’* mice. Indeed, the kinetics of Iaypa 
are significantly faster in K’D’ /~ ;NSED’* LGN neurons compared to 
K’p?! ~:NSED?~ LGN neurons (Fig. 5a—c and Extended Data Fig. 9a), 
indicating a decrease in Ca~* -permeable AMPA receptors. Accordingly, 
NASPM-dependent inhibition of I,yypq is only 20% in K’D’ '~;NSED"*, 
significantly reduced from the 40% inhibition observed in littermate 


198 | NATURE | VOL 509 | 8 MAY 2014 


in K’D’~‘~. Single experiment showing LTP in wild type (d) and K’D’/~ 
(e). EPSC peak amplitude versus time. f, Summary of all 0 ms latency 
experiments: EPSC peak amplitude (% change from baseline) versus time 

(n = 6/N = 6 for each; P > 0.1, t-test). g-i, Deficient LTD in K’D’~'~. Single 
experiment for wild type (g) and K’pD’‘~ (h). EPSC peak amplitude versus 
time. i, Summary of all 1,100 ms latency experiments: EPSC peak amplitude 
(% change from baseline) versus time (n = 7/N = 7 for each; P < 0.01, t-test). 
Grey bars indicate induction period. Insets: average EPSCs (30 traces) before 
(grey) and after (black) induction. f, i, 1 min data binning. Ra, access resistance 
(MQ). n = cells/N = animals. 


Rp :NSED?~ mice (Fig. 5d and Extended Data Fig. 9a). Moreover, 
the I-V relationship is linearized in RK? 'D°~'~;NSED?* LGN neurons 
when spermine is present in the internal recording solution, and bath 
application of NASPM has little additional effect (P > 0.5 at +40 mV), 
similar to wild type (Fig. 5e and Extended Data Fig. 9b). Because the 
Ca’* permeability of AMPA receptors is close to wild-type levels in 
R’p?! ~;NSED?* LGN, it is possible that LTD is also rescued. Indeed, 
the same asynchronous activity pattern that failed to induce LTD in 
K’D"~ (Fig. 3) induces robust LTD (15%; P < 0.001) in K°D’ /~;NSED’* 
neurons, similar to wild type (Fig. 5f, g). Together, these observations 
suggest that restoring expression of H2-D? in neurons is sufficient to 
rescue LTD at retinogeniculate synapses via decreasing the Ca** per- 
meability of AMPA receptors. 


Discussion 


A major finding of this study is that the link between activity-dependent 
synapse pruning during development, and regulation of LTD and CP-AMPA 
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Figure 4 | Increased Ca”*-permeable AMPA receptors at retinogeniculate 
synapses in K’p’— LGN. a-d, Prolonged decay kinetics of I,qpa in 
K’p’~ mice. a, Average Iaypa (5-10 EPSCs) for wild-type versus Rep 
LGN neurons. I,mpa half-width (ms) (b), Iampa decay time (ms) (c) and 
peak amplitude (nA) (d) for wild type versus K’p’— (WT, n= 16/N=4; 
R’pD’, n=22/N= 5). e, Increased per cent inhibition of peak Iampa by 
NASPM (100 4M) in K?D?~/~ (n = 13/N = 4) versus wild type (n = 9/N = 3) 
(**P < 0.01; NS, not significant; Mann-Whitney U-test for b, c). f, Iampa I-V 
curves (normalized to —40 mV). g, Rectification index for wild type 

(n = 14/N = 3), K°D®/~ (n= 9/N = 3), KD” /~ +20 1M NASPM 

(n= 16/N = 4) or K°D”/~ +100 1M NASPM (n = 6/N = 2) (***P < 0.001 
for wild type versus K’D’ /~; P > 0.05 for wild type versus K’D’ /~ +NASPM 
(20 or 100 1M), Mann-Whitney U-test). Ages studied: P8-P13. Also, see 
Extended Data Fig. 7. n = cells/N = animals. 


receptors, requires neuronal MHCI function. It is notable that synapse 
elimination fails despite the fact that retinal waves and retinogeniculate 
basal synaptic transmission are intact. The persistence of multiple innerva- 
tion in K°D® /~ LGN neurons is highly reminiscent of the immature 
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Figure 5 | Neuronal expression of H2-D° restores Ca”* -impermeable 
AMPA receptors and rescues LTD. a-c, Iampa half-width (ms) (a), Iampa 
decay time (ms) (b), and peak amplitude (nA) (c) for KD?!" ;NSED’— 

(n = 9/N = 2) and K’p?'NSED"’* (n = 11/N= 4). d, Reduced per cent 
inhibition of peak I,qmpa by NASPM (100 1M) in K°D’~/";NSED?* 

(n = 10/N = 3) compared to K’D’/~;NSED’~ (n = 8/N = 2); *P <0.05, 
**P < 0.01, ***P < 0.001, Mann-Whitney U-test for a-d. e, Rescue of 
Tampa linear I-V relationship in K’D’ /~;NSED"* LGN. Rectification index 
at +40 mV for K°D? /~ ;NSED®~ (n = 11/N = 3) and K°D® /~ ;NSED?* 


ARTICLE 


synaptic connectivity in LGN of younger wild-type mice”, as well as of 
abnormal connectivity observed in LGNs of dark-reared or tetrodotoxin- 
treated wild-type mice'’. Together, these considerations indicate that 
H2-D? and H2-K° act downstream of neural activity. In studying syn- 
aptic plasticity at retinogeniculate synapses, we imposed plasticity induc- 
tion protocols that mimic natural patterns of spiking activity present in 
the retinogeniculate system during synapse elimination and eye-specific 
segregation. Our observation that in K°D®'~ mice LTD is impaired 
while LTP is intact can explain the failure in retinogeniculate synapse 
elimination: if synapses cannot undergo weakening, then they cannot 
be eliminated. Because immunostaining for MHCI proteins H2-D> 
and H2-K° is co-localized with synaptic markers in array tomography" 
and at synapses in immuno-electron microscopy'*”', these observa- 
tions also argue strongly that H2-D° and/or H2-K” at synapses regulate 
mechanisms of LTD, which in turn are required for synapse elimina- 
tion. It would be useful to know whether other molecules implicated in 
RGC synapse elimination such as Clq (ref. 9), which co-localizes with 
H2-D? and H2-K? at synapses”, also alter LTD or instead act down- 
stream of MHCTI to target already weakened synapses for removal. 

The rescue experiments performed here indicate that a single MHCI 
molecule—H2-D?, when expressed in neurons—is sufficient for func- 
tional synapse elimination and anatomical eye-specific segregation in 
the LGN. By crossing K’D’ /~ mice to NSED” transgenic mice, expres- 
sion of H2-D? alone was restored to neurons but not elsewhere in the 
body, rescuing LTD, functional synapse elimination, Ca?* -impermeable 
AMPA receptors and structural remodelling at retinogeniculate synapses. 
Notably, these brain phenotypes are rescued even though the immune 
system is still impaired in KD’ /~ ;NSED’* mice. Until this experi- 
ment, it was not known if any one MHCI molecule is sufficient either 
in vitro or in vivo, nor has it been possible to separate the general effects 
of immune compromise from the absence of H2-D? and/or H2-K? in 
neurons. Together, our observations argue for a key role for H2-D? in 
reading out endogenous activity patterns into a lasting structural frame- 
work. In the human genome, as in mice, the MHCI (HLA) locus is large 
and highly polymorphic. Recent genome-wide association studies have 
consistently linked specific single nucleotide polymorphisms in MHCI 
to schizophrenia’. Our observations offer possible mechanistic insight: 
alterations in expression levels of specific MHCI molecules at neuronal 
synapses could trigger changes in activity-dependent plasticity and syn- 
aptic pruning during critical periods of human development, generat- 
ing lasting alterations in circuits and behaviour. 
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(n = 13/N = 5) shows significant difference (**P < 0.005); K’D’~/~;NSED?* 
(+NASPM) (n = 7/N = 3) is not significantly different to K’D’-!-;NSED?* 
(P > 0.05), Mann-Whitney U-test. See also Extended Data Fig. 9. 

mean + s.e.m. f, g, LTD rescued in K’D’~'-;NSED°* LGN neurons. 

f, Ensemble average of all experiments at P8-9 (see Fig. 3). Grey bar indicates 
LTD induction period. 1 min data binning. g, Average per cent change 
(mean + s.e.m.) for wild type (N= 7), K°D” /~ (N= 7), K°D’~'~;NSED?— 
(N=7) and K’D’ ';NSED’*: (N = 9). *P < 0.05, **P < 0.01, NS, not 
significant, t-test. P8-13 ages studied. n = cells/N = animals. 
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METHODS SUMMARY 


All experimental protocols were approved by Stanford University Animal Care 
and Use Committees. K°D’~/~ mice were provided by H. Ploegh” and NSED?* 
mice by M. B. A. Oldstone”. These mice were maintained on C57BL/6 backgrounds. 
Crosses of these two lines generated K’pe ;NSED?* mice plus littermate controls. 
Electrophysiological recordings were made from LGN neurons by cutting para- 
sagittal brain slices containing dorsal lateral geniculate nucleus (dLGN) and optic 
tract; synaptic transmission and degree of innervation was assessed as previously 
described’. For plasticity experiments at retinogeniculate synapses, perforated 
patch-clamp technique and induction protocols with natural activity patterns were 
used*’. Multi-electrode array recordings of retinal waves and anatomical labelling 
of retinogeniculate projections to determine status of eye-specific segregation were 
carried out according to ref. 18 and ref. 14. Pharmacological investigation of CP- 
AMPA receptors was carried out according to ref. 34. All experiments were con- 
ducted and analysed blind to genotype except in Fig. 4 (genotype was obvious to 
experimenter because of phenotype). Sample sizes were chosen for each experi- 
ment to reach statistical significance (P value equal to or less than 0.05; details given 
in Methods and figure legends). 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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METHODS 


All experimental protocols were approved by Stanford University Animal Care 
and Use Committees. K’D’-/- mice were provided by H. Ploegh’? and NSED” 
mice by M. B. A. Oldstone”. These mice were maintained on C57BL/6 backgrounds. 
Crosses of these two lines generated Kp?! ~;NSED?* mice plus littermate controls. 
Electrophysiological recordings were made from LGN neurons by cutting para- 
sagittal brain slices containing dorsal lateral geniculate nucleus (LGN) and optic 
tract; synaptic transmission and degree of innervation was assessed as previously 
described'*. For plasticity experiments at retinogeniculate synapses, perforated 
patch-clamp technique and induction protocols with natural activity patterns were 
used*'. Multi-electrode array recordings of retinal waves and anatomical labelling 
of retinogeniculate projections to determine status of eye-specific segregation were 
carried out according to ref. 18 and ref. 14. Pharmacological investigation of CP- 
AMPA receptors was carried out according to ref. 34. All experiments were con- 
ducted and analysed blind to genotype except in Fig. 4 (genotype was obvious to 
experimenter because of phenotype). Sample sizes were chosen for each experiment 
to reach statistical significance (P value equal to or less than 0.05; details given in 
Methods and figure legends). 

Genotyping. For K’D?‘-: three primer sets were mixed together (A: 5’-CGG 
AAGTCGCCTTACCTGATTG-3’, B: 5’-CAGCAGAAACATACAAGCTGTC-3’, 
C: 5'-GTCTCCTCTGGCACCTATGGG-3’), from which bands of 520 bp were 
detected for wild type, 400 bp for K°D’~/~ and double bands of 520 bp + 400 bp 
for heterozygote H2-K°’*!"H2-D°*'—_ For NSED’ mice and K’°D’/~ :NSED"*: sev- 
eral primer sets were used, resulting in reliable and consistent results: a primer set 
consisting of NSED’-A (5'-GAGATCGACTCTAGAGGATC-3’) and NSED?-B1 
(5'-GCGCTCTGGTTGTAGTAGCC-3')” or NSED?-B2 (5'-GTAGCCGAGCA 
GGTTCCTCA-3’) amplifies part of NSE promoter region and exon 2 of H2-D? 
cDNA (~500 bp) (Extended Data Fig. 3b). Another primer set was designed to 
detect a unique p> minigene expression cassette of ~800 bp (A:5'-CGACACAGG 
GACGCTGACG-3’, B: 5’- CAGCTCCAATGATGGCCATAG-3). Taqman qPCR 
primer mix (H2-D1/H2-K1 (Mm04208017_mH) was also used, with same results. 
Slice preparation. Parasagittal brain slices containing the dorsal lateral geniculate 
nucleus (dLGN) and optic tract were obtained as previously described’*. The brain 
was removed rapidly and immersed in oxygenated ice-cold choline-based slicing 
solution (composition in mM: 78.3 NaCl, 23 NaHCO, 33.8 choline chloride, 2.3 
KCl, 1.1 NaH2PO,, 6.4 MgCl,6H20, 0.45 CaCl,*2H20, 23 glucose, aerated with 
95% O2/5% CO2). The hemispheres were separated at an angle of 5-10° relative to 
midline. The larger hemisphere including optic tract was glued onto the cutting 
stage of a vibratome (Leica VT 1000S) at an angle of 15-20°, which puts optic tract 
and LGN in the same plane relative to the blade; 250-|1m-thick slices were cut. Note 
that only one slice per animal containing both intact optic tract and LGN can be 
obtained due to the small size of the LGN at relevant ages. Cortex was separated 
from thalamus to prevent excitatory recurrent disynaptic responses between dLGN 
and corticothalamic neurons. Slices were recovered at 31 °C for 30 min in choline 
chloride slicing solution, and then for 30 min in artificial cerebrospinal fluid (ACSF) 
(ACSF composition in mM: 125 NaCl, 26 NaHCOs, 2.3 KCl, 1.26 KH2POu,, 1.3 
MgSO,.°7H320, 2.5 CaCl,*2H20, 25 glucose, aerated with 95% O2/5% COz). 
Electrophysiology. All recordings were at room temperature in a chamber with 
constant ACSF flow, except for paired-pulse ratio measurements, which were 
made at 30-32 °C. Stimulating electrode(s) were placed along the optic tract at 
the ventral end of dLGN and/or lateral edge of dLGN aligned with the recording 
site. LGN neurons were distinguished by their large soma size and 3+ primary 
dendrites'*. For synaptic current measurements, glass electrodes were filled with 
Cs* -based internal recording solution (2-4 MQ, composition in mM: 105 CsCl, 
20 TEA-Cl, 2 MgCl, 1 EGTA, 10 HEPES, 3 Mg-ATP, 15 phosphocreatine, 1 Na-GTP, 
5 QX-314, pH 7.4, 280 mOsm). To block inhibitory synaptic responses, SR95531 
(20 uM; Tocris) or picrotoxin (100 1M; Sigma-Aldrich) was bath applied. AMPA 
receptors were blocked with DNQX (20 1M; Tocris), and NMDA receptors were 
blocked with DL-APV (100 UM; Tocris). For NMDA/AMPA ratio and spermine 
(0.1 mM, Sigma-Aldrich) dependent I,mpa measurements, D600 (methoxyvera- 
pamil hydrochloride, 0.1 mM; Tocris) was added in Cs* -based internal solution to 
block voltage-gated calcium channel activation. Spermine-containing internal 
solution was prepared fresh every 1-2h. NASPM (1-naphthyl acetyl spermine 
trihydrochloride, 20-100 1M; Tocris) was bath applied. Rectification index was 
calculated by dividing peak I,map at +40 mV with at —40 mV from I-V curves 
measured using spermine-containing internal solution”. 

For patch-clamp recordings: synaptic responses were recorded using an Axopatch 
200B or 700A amplifier (Axon Instruments/Molecular Devices), digitized using 
Digidata 1322A (Axon Instruments) and data acquisition was performed by Clampex 
9.2 (Axon Instrument). For whole-cell voltage clamp experiments, the series res- 
istance (Rs) was corrected by 60-80% after cancellation of capacitive components. 
For synaptic plasticity experiments in which EPSC size was less than 120 pA, 
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compensation was omitted. Calculated liquid junction potentials (Cs* internal: 
4mV; K* internal: 9 mV) were not compensated, except for spermine-dependent 
I-V relationship plots (Figs 4 and 5). Series resistance (Rs) was constantly mon- 
itored throughout experiments by injecting small hyperpolarizing currents. 

For synaptic plasticity experiments: perforated patch-clamp technique was per- 
formed using a K* -based internal solution containing amphotericin B (Fisher Scien- 
tific) (composition in mM: 115 K-methane Sulphonate, 20 KCl, 0.2 EGTA, 10 
HEPES, 4 Mg-ATP, 10 phosphocreatine, 0.3 Na-GTP, pH 7.3 with KOH). Note 
that perforated patch recordings are necessary to prevent dialysis of intracellular 
components required for plasticity?*!°. Amphotericin B stock solution (40- 
60 mg ml _' in DMSO) was prepared fresh every day, from which 100-200 pg ml! 
internal solution was prepared hourly and kept on ice. The tip of the patch elec- 
trode was filled with internal solution without amphotericin B before back-filling 
with amphotericin B containing internal solution. Following the formation of GQ 
seal, gradual drop of access resistance was monitored until stabilizing at 20-120 MQ. 
Trials with sudden changes in Rs (clearly discernible due to rupture of perforation 
or clogging of patch pipette by amphotericin B precipitation) were discarded. 
EPSC changes with stable Rs within +20 MO were used for analysis except when 
gradual changes in Rs were correlated with changes in EPSCs size. After obtaining 
a stable baseline for 10-20 min, the configuration was switched to current-clamp 
mode and plasticity induction protocols were applied by pairing optic tract stimu- 
lation with postsynaptic current injection for 1s to generate 10-20 Hz action 
potentials either synchronously or with 1,100 ms latency (Fig. 3a-c)°". V., (resting 
membrane potential measured at J = 0 configuration in perforated patch clamp 
mode) was not different between wild type (—58.5 + 1.2 mV: N= 16 cells from 16 
animals), K’D’ ‘~ (—59.6 + 1.3 mV:N = 10), K’D’ /~ ;NSED"* (—60.5 + 1.9 mV: 
N= 13) or K°D’~'";NSED®” (—62.2 + 2.5mV: N= 7) (P> 0.1, t-test). The par- 
ing protocol was repeated 10 times over a 40s interval. Data were acquired with 
resting membrane potentials (measured at I = 0) greater than —53 mV. For the 
NASPM LTD rescue experiments (Extended Data Fig. 8), 20 4M NASPM was 
added to the bath, and the asynchronous pairing protocol (10 times over a 40s 
interval) was repeated twice, separated by 20 min. N = one cell per each animal. 
Percentage change is mean + s.e.m. of EPSC amplitude averaged over 30 min after 
conditioning period as compared to baseline per cell. 

For optic tract stimulation, concentric bipolar (CBABD70; FHC Inc.) or low- 

resistance glass-electrode filled with ACSF were used for stimulating electrodes 
without noticeable differences. Stimulus strength was regulated using an ISO-Flex 
stimulus isolator (A.M.P.I.) and applied at 0.025-0.033 Hz. For plasticity experi- 
ments, stimulus strength was set to achieve stable EPSCs (10-120 pA) activated by 
a single or a few fibres, and stimuli were applied at 0.025 Hz. To measure spermine- 
dependent I-V rectification, stimulation intensity was adjusted to evoke Iaypa 
currents in the 100-300 pA range, due to the presence of Ca”*-permeable AMPA 
receptor-mediated synaptic responses in K’D’"~ mice. 
Evaluation of the number of synaptic inputs. The number of connected fibres 
was estimated using two approaches modified from ref. 15. First, synaptic res- 
ponses were recorded while varying optic tract stimulus intensity from minimal 
to maximal. Stimulation in increments of 1.0-2.5 1A was used between 0 1A and 
100 pA, and larger increments (10-20 A) were used at stimulation intensities 
greater than 100 1A. The stimulus-response profile provides an estimate of the 
number of RGC inputs to the cell. We also estimated the number of inputs to each 
LGN neuron based on the single-fibre AMPA current elicited at —70 mV by min- 
imal stimulation as a per cent of the total current elicited by maximal stimulation 
for the same cell (Fibre fraction). For minimal stimulation, the method is based on 
the assumption that minimal stimulation elicits synaptic responses from only a 
single fibre in the optic tract (SF-AMPA). After a synaptic response was observed, 
the intensity of electrical stimulation was lowered to obtain a failure rate higher 
than 50% (20-50 repeats). Synaptic responses with an onset latency of <5 ms 
within a 1-ms window were regarded as direct, monosynaptic responses. The mean 
peak amplitudes of successful responses were taken as the strength of the single 
fibre synaptic response (SF-AMAP). Onset latency of SF-AMPA by minimal 
stimulation (estimated as time to reach 10% of peak I,ypa) was not different 
between wild type, K’p’'-, K°p’'-:NSED?~ or K°D’~'~:NSED’* (Extended 
Data Fig. 1c). For maximal stimulation, electrical stimulation was increased until 
the peak synaptic currents reached steady-state level (1-600 1A range). Fibre frac- 
tion was calculated as a ratio of single-fibre synaptic strength (SF-AMPA) to maxi- 
mal synaptic strength (Max-AMPA) from the same cell (FF = SF/Max-AMPA per 
cell, %). The number of cells recorded per animal was limited to 4 for SF-AMPAs 
and Max-AMPA measurements. 

Kinetics of EPSCs were measured as both the width at half-maximal EPSC 
amplitude (half-width), and decay time within 80-20% range of peak EPSCs using 
Clampfit. 

Multi-electrode array recordings of retinal waves. Retinas were isolated from wild- 
type and K’p?'~ mice and pieces of retina were placed retinal-ganglion-cell-side 
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down onto a 60-electrode multi-electrode array arranged in an 8 X 8 grid (exclud- 
ing the four corners) with 10 jum diameter electrodes at 100 um spacing (Multi 
Channel Systems)'*“*. The retina was held in place with a weighted piece of dialysis 
membrane and superfused continuously with artificial cerebral spinal fluid (ACSF) 
containing (in mM): 119.0 NaCl, 26.2 NaHCOs, 11 glucose, 2.5 KCl, 1.0 K,HPO,, 
2.5 CaCl, and 1.3 MgCl. ACSF was oxygenated with 95% O, and 5% CO, and 
maintained at 32°C, pH7.4. Each preparation was allowed to equilibrate for 
20 min before data acquisition. Spontaneous firing patterns were then recorded 
for 60 min. Voltage traces on each electrode were sampled at 20 kHz and filtered 
between 120 and 2,000 Hz. Events that crossed a spike threshold were sorted offline 
to identify single units using Plexon Offline Sorter software. Single units were 
identified by combining principal component analysis together with a valley-seeking 
algorithm and were inspected manually. The mean firing rate of all units throughout 
the recording was calculated and units with a mean firing rate less than 10% of 
overall mean firing rate were excluded from further analysis. Spike-sorted data 
were analysed in MATLAB (Mathworks). To identify bursts, a modified Poisson 
Surprise algorithm was used as described previously. Mean burst values for each 
retina were grouped according to age (P5-P8 and P10-P12) and genotype (wild 
type and K’D’‘~), Differences between means of different genotypes within each 
age group were evaluated for statistical significance using a t-test. To determine 
spatial properties of waves, a pair-wise correlation index, CI, was computed as a 
function of distance between two cells for all spikes in the recording, using a 
method described previously***. The correlation index gives a measure of the 
likelihood relative to chance that a pair of neurons fire together within a 100-ms 
time window. The distance between cells was approximated as the distance between 
the electrodes on which the activity of the cells was recorded. The cell pairs were 
grouped according to their intercellular distance, and medians were computed 
over all cell pairs in a given distance group for each individual retina. The median 
correlation indices were then averaged across retinas, for each age and genotype, 
and plotted as a function of intercellular distance. 
Anterograde labelling of retinal ganglion axons and multiple threshold ana- 
lysis. All methods were performed according to previously described studies'*“”. 
P31-34 mice were anaesthetized with isoflurane. Cholera toxin B (CTB) subunit 
(1-2 pl) conjugated to AF488 was injected in the right eye and CTB subunit 
conjugated to AF594 was injected in the left eye (1 mgml ‘ dissolved in 0.2% 
DMSO, Invitrogen). After 24h, animals were perfused and fixed by transcardial 
perfusion of 0.1 M phosphate buffered saline (PBS), then ice-cold 4% paraformalde- 
hyde (PFA) in 0.1 M PBS. Brains were post-fixed in 4% PFA overnight. 100-,1m- 
thick sections were cut on a freezing microtome. Sections were mounted with 
Prolong Antifade Gold (Invitrogen), and after coverslipping were imaged on a 
Zeiss Leica PS2 confocal microscope (Leica Microsystems). dLGN sections taken at 
the middle of the rostral-caudal extent of dLGN where the area of the ipsilateral eye 
projection is greatest were imaged in red and green channels. Images were acquired 
and analysed such that the peak intensity values were below maximum, and 
multiple threshold analysis was used by varying red and green channels at intensity 
thresholds of 20%, 40%, 60%, 80% and 100% of maximum. To measure the amount 
of overlap between inputs from both eyes, pixels overlapping in both red and green 
channels were identified using Image J (NIH) and the Colocalization plug-in tool. 
The total area of overlapping pixels was represented as a percentage of total dLGN 
area. Two-way ANOVA was used for statistical analysis. 
RT-PCR. RNA was prepared from tissue using RNAqueous-4PCR (Ambion, 
Life Technologies) followed by cDNA synthesis using iScript Select cDNA syn- 
thesis kit (Bio-Rad). RT-PCR was performed using 0.5-1.0 ug of template cDNA 
using primers as follows (Extended Data Fig. 3b, c). H2-D°: A, 5’-CAAGAGC 
AGTGGTTCCGAGTGAG-3’; B, 5’-CTTGTAATGCTCTGCAGCACCACT-3’. 
GAPDH (glyceraldehyde-3-phosphate dehydrogenase; used as a reference gene): 
A, 5'-ATTGTCAGCAATGCATCCTGC-3’; B, 5'-AGACAACCTGGTCCTCA 
GTGT-3’, 

The quality of cDNAs was confirmed by genotyping PCR reactions using 0.5- 
1 1g of cDNA as template and the samples with genomic DNAs were discarded. 
H2-D?-specific RT-PCR bands from K’D’~/~ ;NSED** were confirmed by sequenc- 
ing following cloning into pCR2.1-TOPO TA vector (Life Technology Corporation). 
Taqman qPCR. RNA was extracted from each thalamus and cDNA was synthe- 
sized using iScript cDNA Synthesis kit (Bio-Rad). Gene expression was analysed 
with Taqman Gene Expression Assays (Applied Biosystems) for H2-D1/H2-K1 
(Mm04208017_mH) and housekeeping gene GAPDH (Mm99999915_g1). All 
reaction mixes contained 5% 20 Taqman Gene Expression Assay, 50% Taqman 
Master Mix (4304437), 25% MilliQ water, and 200 ng of cDNA in 20 ll total volume 
per well. Experiments were carried out on a 7300 Real Time PCR system (Applied 
Biosystems). The relative amount of tested message was normalized to GAPDH, 
and efficiency corrected based on standard curves. Gene expression levels were 
normalized relative to wild-type control, which was set to 1. All samples were run 


in triplicate at least three times. Analyses were carried out as previously described 
using one-way ANOVA®. 

Immunoprecipitation/western blots of H2-D”. Brain lysates in lysis buffer (150 
mM NaCl, 50 mM Tris, 0.25% sodium deoxycholate, 1% NP-40, 1mM EGTA, 
1mM PMSF, 1X Pefabloc (Roche)) were prepared by shearing ten times in a 
dounce homogenizer, then centrifuging at 12,000g for 10 min. Supernatants were 
precleared by incubation with Protein G-agarose beads (Invitrogen), and protein 
amounts were measured using a BioRad protein assay. Normalized lysates were 
then incubated with 12 pg anti-H2-D? antibody 28-14-8 (BD Biosciences, 553600) 
overnight at 4 °C. Protein G-agarose was incubated with samples for 45 min. Beads 
were washed three times with lysis buffer, and then heated to 85 °C for 3 min in 
NuPAGELDS 4X sample buffer (Invitrogen) + 1% B-2 mercaptoethanol. Samples 
were then electrophoresed on an SDS-PAGE gel, transferred to Immobilon-P 
PVDF transfer membrane (Millipore), and western blotted with rabbit monoclo- 
nal antibodies to H2-D®, made against the extracellular domain of H2-D». 
GluR1/2 expression levels. Thalami were isolated from P22 wild-type or K’D?~/~ 
mice, and synaptosome-enriched fractions prepared as described”. Freshly iso- 
lated individual thalami were homogenized in homogenization buffer (10 mM 
HEPES pH 7.3, 0.5mM EGTA, 33% sucrose, 4mM Pefabloc SC PLUS (Roche), 
and 0.2 mM phenylmethanesulphonyl fluoride (PMSF)), and centrifuged (10 min 
at 2,000g). Supernatants were passed through three layers of 100 um pore nylon 
membranes, and then through 5 um nitrocellulose filters. Filtrates were centri- 
fuged (10,000g for 10 min), and pellets re-suspended in homogenization buffer. 
Protein amounts were determined by BioRad Protein Assay (BioRad) (about 
200 [tg synaptosomal protein was obtained per thalamus), and SDS-PAGE sample 
buffer (Novex) was added to a 1X concentration. After heating (85 °C for 4 min), 
equivalent amounts of protein from each sample (20 1g per sample) were electro- 
phoresed on a SDS-PAGE gel, transferred to PVDF membrane (Bio-Rad), and 
western blotted with antibodies to GluR1 (Millipore, AB1504), GluR2 (Abcam, 
AB133477), N-tubulin (Abcam, AB18207), or GAPDH (Abcam, AB9485). 

For quantification, gels were scanned and intensities of bands were quantified 
using the Image J program. After quantification, intensity values for GluR1 and 
GluR2 were normalized by dividing them against each GAPDH or N-tubulin 
values from the same thalamus. Western blots typically were run with multiple 
wild-type and K’D’~‘~ samples; to compare results from different western blots, 
the intensity measurement for each sample thalamus was normalized against the 
average value derived from all wild-type bands on that same blot. The GluR1/ 
GluR2 ratio for each thalamus was then determined. Finally, an average GluR1/ 
GluR2 ratio was calculated from each individual thalami (Extended Data Fig. 7e). 

Mouse cortical cultures were made as described’. Wild-type or K°7D’ ~~ cor- 

tical neurons were isolated at E16.5, and cultured on 6-cm tissue culture plates for 
14 days in vitro. Plates were then washed twice with PBS and cells were lysed 
directly on the plates with lysis buffer (150 mM NaCl, 50 mM Tris-HCl pH7.5, 
0.25% sodium deoxycholate, 1% NP-40, 1 mM EGTA, 1 mM PMSF, 1X Pefabloc 
SC PLUS (Roche)) for 10 min. Lysates were then centrifuged at 12,000g for 10 min. 
Supernatants were normalized by BioRad Protein Assay (BioRad), and equivalent 
amounts of protein from each sample (25 ug per sample) were subjected to SDS- 
PAGE and western blotting with antibodies to GluR1, GluR2, GAPDH, and/or 
N-tubulin, as described above. Gels were scanned and bands quantified using 
Image J. Values were then normalized by dividing them by GAPDH or N-tubulin 
values, and an average GluR1/GluR2 ratio was calculated from each individual 
sample (Extended Data Fig. 7f). Non-parametric Mann-Whitney U-test (Prism 
software) was used to calculate P values for all analyses (two-tailed). 
Data analysis and statistics for electrophysiology experiments. Data analysis as 
well as graph plotting were done using Clampfit 10.2 (Axon Instruments) and the 
OriginPro8.0 (OriginLab Corporation). Cages containing mice of different gen- 
otypes used for each experiment were relabelled and randomized so that experi- 
menter did not know genotype. Experiments were then analysed blind to genotype 
unless stated in figure legends. In addition, Student’s t-test was used for data with 
normal distribution and Mann-Whitney U-test for data with non-normal distri- 
bution. Shapiro-Wilk test was conducted for normality test. N = number of ani- 
mals; n = number of cells. Average data are presented as mean + s.e.m., unless 
stated otherwise. For synaptic plasticity experiments, t-test was used. Two-tailed 
analysis was used throughout the study. 
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Extended Data Figure 1 | Comparison of retinogeniculate synaptic 
responses in wild type versus K’7D°~'~, a, b, Examples of minimal stimulation 
for wild type (open circles) and K’p?'~ (filled circles). Plot of EPSC peak 
versus number of stimulations (grey box represents failures, >50%). c, No 
difference in onset latency of SF-AMPA between all genotypes. Onset latency of 
SF-AMPA was estimated using minimal stimulation as time (ms) to reach 10% 
of peak I,mpa from stimulation artefact (wild type: 3.0 + 0.3 (n = 12 cells/ 
N=6animals); K°D?/~: 2.7 0.1 (n = 23/N = 8); K°D® /" ;NSED"*: 

3.0 + 0.2 (n = 17/N = 7); K°D” '";NSED?:2.6 + 0.2(n = 19/N= 5); P>0.5, 
t-test). d, Cumulative probability histogram shows no difference in Max- 
AMPA between wild type and K’D’‘~. Inset: mean + s.e.m. Wild type: 

2.6 + 0.4nA (n = 14/N = 6); K°D ~: 2.9 + 0.4nA (n = 22/N = 8); P>0.1, 
Mann-Whitney U-test. e-h, Presynaptic release probability at K’D’/~ 
retinogeniculate synapses is similar to wild type at P20-24. e, f, Examples of 
EPSCs evoked by paired-pulse stimulation of optic tract (20 Hz) recorded in 
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whole-cell mode in individual LGN neurons from wild type (e) versus K’D?/~ 
(f). g, h, Paired-pulse depression (PPD) (%) (EPSC 2/EPSC 1) over varying 
intervals. g, Wild type (open circle) versus K’p?-~ (filled circle) without 
cyclothiazide (CTZ), a blocker of AMPA receptor desensitization. Wild type 
versus K°D?~/~: 1 Hz: 82.3 + 2.6 (n = 10) versus 77.4 + 3.4 (n = 8); 10 Hz: 
44.9 + 2.3 (n = 9) versus 42.8 + 3.9 (n = 9); 20 Hz: 37. 1 + 2.1 (n = 10) versus 
37.3 + 2.8 (n = 9) (P > 0.1 for each). h, Wild type (open circle) versus Rep 
(filled circle) with CTZ (20 uM). Wild type versus K’p’/-: 1 Hz: 79.0 42.4 
(n = 9) versus 81.8 + 1.1 (n = 7); 10 Hz: 59.2 + 3.0 (n = 8) versus 57.4 + 3.7 
(n = 7); 20 Hz: 58.5 + 2.3 (n = 8) versus 56.6 + 4.1 (n = 7) (P>0.1 for each). 
N= 4 for wild type; N = 3 for K’p’ ~~ for g, h. There was no significant 
difference in PPD between wild type and K’D’ ~~, but note significant decrease 
of PPD +20 uM CTZ versus 0 uM CTZ application for both wild type and 
Kp’ at 10 Hz and 20 Hz (P< 0.05). t-test. mean + s.e.m. n = cells/ 

N= animals. 
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Extended Data Figure 2 | Intact spatio-temporal pattern of retinal waves in 
—'— mice at P5-12. a, Correlation indices as a function of inter-electrode 
distance for all cell pairs for wild type (grey) and K’D’ /~ (black) at P5-P8. 
Data points correspond to mean values of medians from individual data sets 
and error bars represent s.e.m. b-e, Summary of temporal firing patterns for 
retinas isolated from wild-type (open squares) and K’D’~/~ (filled squares) 
mice at P5-P8 (stage II, cholinergic waves) and P10-P12 (stage III, 
glutamatergic waves). Open circles correspond to the mean values of individual 
retinas. b, Interburst interval for stage II (in min): wild type: 1.9 + 0.2; Kp’: 


Burst duration 


P5-P8 


P5-P8 


P10-P12 


Mean firing rate 


© 


° 
° 
o th! oo 


Firing rate (Hz) 


8 dt 
00 GBH OO 


P10-P12 


P5-P8 


P10-P12 


1.6 + 0.2; stage III: wild type: 1.0 + 0.3; K’D®‘~: 0.9 * 0.1.¢, Firing rate during 
burst for stage II (in Hz): wild type: 8.6 + 1.9; K°D’'~: 8.4 + 1.7; stage III: wild 
type: 19.7 + 2.1; R’p’'-:18.6 + 2.2. d, Burst duration for stage II (in seconds): 
wild type: 4.2 + 0.2; K’D’ ‘~: 3.8 + 0.1; stage III: wild type: 2.6 + 0.6; K’D? /: 
2.1 + 0.2. e, Mean firing rate for stage II (in Hz): wild type: 0.5 + 0.1; K°D? : 
0.5 + 0.2; stage III: wild type: 1.0 + 0.1; K’D”/~: 0.8 + 0.1, mean + s.e.m. 

(P > 0.05 for each, t-test, N = 6 animals for each group, except when N = 5 for 
stage III wild type; non-blind experiments). 
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Extended Data Figure 3 | Rescue of H2-D° expression in brain of 
K’D?'-;NSED"* mice. a, Diagram of breeding strategy to generate 
K’p’';NSED"* mice. K°D?/~ (white indicates absence of H2-D°) were 
crossed to NSED? transgenic mice (black indicates presence of H2-D? in both 
body and brain). From F, offspring, H2-K°*!~ H2-D°*!-;NSED"* mice 

(grey body, black brain) were selected and crossed to K’D’~'~ mice further, 
generating K’D’ /~;NSED?* (black brain with white body indicates rescue of 
H2-D? expression in brain alone) and K’p’~'-;NSED*~ littermate controls 
(white body, white brain). b, H2-D?-specific primers. Solid arrows: forward 
(exon 2) and reverse (exon 3) for ~200 bp spliced mRNA as well as unspliced 
proRNA (~500 bp) (e; exon, i; intron). Dotted arrows: NSED?-A (forward, NSE 
promoter region) and NSED’-B1 and/or B2 (reverse, exon 2) for genotyping 
and mRNA detection”’. c, RT-PCR showing rescue at P10 in thalamus of 
K’D?'";NSED"* mice cDNAs. Wild-type thalamus (WT(T)) shown as 
positive control; K’p’"~ thalamus (K°D’ ~~ (T)) and spleen (K’pD’-‘~ (S)) as 


negative controls, and various organs from K’pD’~'-:NSED"* mice (spleen (S), 
liver (L), gut (G), thalamus (T), hippocampus (H), cortex (C), retina (R)) were 
used as templates. d, Quantitative PCR comparing relative H2-K°/H2-D? gene 
expression in wild type, K’pD’~"~ and K°D®~'~:NSED?* thalami. Left: results 
show small but highly significant rescue of H2-D® mRNA expression in 
K’p’'-;NSED?* (K°D?~'~: 0.00003 + 0.00002; K’D®~/~ ;NSED**: 

0.0035 + 0.00044 relative to wild type: 1.0056 + 0.032, ***P = 0.0001). Each 
point represents average relative gene expression for one mouse. Right: raw 
H2-K°/H2-D? Ct values for each genotype (wild type: 27.4 + 0.1; K’D’": 
44,3 + 0.4; K’D’ ~~ ;NSED"*: 35.9 + 0.3; ***P < 0.001), one-way ANOVA, 
mean + s.e.m. N= 5 animals for wild type, 7 for K’p’~'~, 10 for 
K’D®'-;NSED’*. e, Rescue of H2-D° protein in Kp’! ;NSED?* 

brains at P60. Western blot of immunoprecipitation from whole brain 

(three pooled brains) lysate from K’p® and K’D’ ':NSED"*. 
H2-D?-specific signal from K’D’/~;NSED"* appears below IgG band. 
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SF-AMPA and Max-AMPA recorded at retinogeniculate synapses according animals; for KD’ '";NSED’*: n = 16/N = 7 (P > 0.05, Mann-Whitney 
to H2-D° genotype. a, b, Max-AMPA (a) and SF-AMPA (b) showed U-test). Fibre fraction calculated from Max-AMPA and SF-AMPA 
similar cumulative probability histograms between wild type (black line), measurements is similar between wild-type and K’D’ /~;NSED"* mice 
K’pD’~' ;NSED"* (dashed grey line), K’D’~'~ (dashed black line) and (Fig. 2d). 


K’D?/~ ;NSED*~ (grey line). Number of experiments are the same as in the 
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Extended Data Figure 5 | Neuronal H2-D” expression in K°D?~'~ 

mice rescues impaired eye-specific axonal segregation at P34. a, Coronal 
sections of dLGN of K’D ~!~ .NSED?— (top row) and RK D’~!~;NSED?* 
(rescue; bottom row) showing pattern of RGC axonal projections from the 
two eyes after intraocular tracer injections of CTB AF594 (red channel; 
contralateral eye injected) or AF488 (green channel; ipsilateral eye injected). 
Left: thresholded fluorescent images of dLGN at 60% maximum signal intensity 
(see Fig. 2e). Right: overlap of RGC projections (white pixels) from ipsilateral 
and contralateral eyes displayed for 20%, 40% and 60% maximal threshold 
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Threshold (% maximum) 


for K’D’~'~ ;NSED?~ (top) and K’D?'~ ;NSED’* (bottom). Overlap = pixels 
labelled in both red and green channels. b, Mean percentage dLGN 

area + s.e.m. pixel overlap for K’D’ ‘~;NSED"~ (filled squares; N = 3) versus 
K’p?~'~;NSED°* (open triangles; N = 4): 0% threshold: 8.3 + 1.1 versus 

1.2 + 0.3; 20% threshold: 12.2 + 1.5 versus 2.9 + 0.7; 40% threshold: 17.9 + 1.8 
versus 5.3 = 0.9; 60% threshold: 21.1 + 1.8 versus 7.2 + 0.9; 80% threshold: 
23.2 + 1.3 versus 11.1 = 1.2; 100%: 25.3 = 1.9 versus 14.0 + 1.5 (P< 0.05, 
two-way ANOVA). See Methods and ref. 47. 
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Extended Data Figure 6 | Intact release probability at K’p?- 
retinogeniculate synapses before eye opening. Paired-pulse stimulation was 
delivered to the optic tract at 10 Hz, similar to the natural firing frequency of 
RGCs (Extended Data Fig. 2c), and whole-cell recordings were made from LGN 
neurons in slices aged between P8-13. Paired-pulse stimulation resulted in 
synaptic depression, represented as EPSC 2 divided by EPSC 1 (%). In 0 uM 
CTZ (left panel): wild type: 67.0 + 2.9 (n = 11/N = 4); K°D” '~: 64.2 £2.8 
(n = 7/N = 2). In20 uM CTZ (right panel): wild type: 48.6 + 4.9 (n = 8/N = 3); 
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Rep: 46.6 +2.5 (n = 7/N = 2) (P> 0.1 for each, t-test), mean + s.e.m. 
20 mM BAPTA containing Cs“ -internal solution was used for this experiment 
due to prolonged kinetics of EPSCs in K’D? ‘~. The identical paired-pulse 
ratios between wild type and K’D’/~ are consistent with the conclusion that 
presynaptic release probability is intact at P8-13 retinogeniculate synapses in 
K’D’~‘~ mice. (See also Extended Data Fig. le-h for similar conclusion at 
P20-24, after synapse elimination is largely complete.) 
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Extended Data Figure 7 | Normal NMDA/AMPA ratio but increased 
Ca”*-permeable AMPA receptors at retinogeniculate synapses in KD? '~ 
mice. a, b, NMDA/AMPA ratio is unchanged in K’pD’~~ mice. a, NMDA/ 
AMPA ratio (%): peak I,ypa measured at —70 mV (+20 1M SR95531) versus 
peak Inmpa at +40 mV (+20 1M SR95531 + 20 1M DNQX): wild type: 
61+6.8 (n= 10/N=4); K°D /~: 70.6 + 12.1 (n =7/N = 3) (P> 0.1, t-test) 
mean ~ s.e.m. b, Example recordings from individual neurons for wild type 
(left) and K’D’~'~ (right). APV (100 1M) was added at the end of each 
experiment to confirm NMDA-mediated synaptic currents. D600 in pipette. 
c, Example showing effect of NASPM (100 .M bath) on I, mpa: note significant 
blockade of Iqypa in K°7D?/~. Grey line, before NASPM; black line, after 


N-Tubulin 


GAPDH 


Internal: + spermine 
Bath: +SR95531 (20 uM) 
+ APV (100 uM) 


WT K°De* 


3.570 wt 


@K'D” * 


3.0 
2.5 
2.0 


1.5 


1.0 


GluR1/2 ratio 


0.5 


NASPM (5 traces averaged for single cell). S$R95531 (20 uM) added to bath for 
a-c. d, Examples for I4mpa normalized to EPSC amplitude at —40 mV. Note 
reduction in EPSC amplitude at +40 mV in K’D’ /~ but not wild type. 100 uM 
APV + 20M SR95531 in bath. Spermine (100 LM) and D600 (100 1M) in 
pipette. Ages: P8—-13. Experimenter was aware of genotype due to obvious 
differences in time course of EPSCs and effects of NASPM. e, Example 
western blot (left) and GluR1/GluR2 ratio (right) of P22 thalamus; wild type: 
1.0+0.1 (N= 12); KD’ ‘~: 1.3 + 0.1 (N= 13) (P = 0.07). f, Example western 
blot (left) and GluR1/GluR2 ratio (right) of cultured cortical neurons; wild type: 
1.0 + 0.1 (N= 4); K°D’~~; 2.3 + 0.7 (N= 4) (*P = 0.03). Mann-Whitney 
U-test for e, f, n = cells/N = animals. 
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Extended Data Figure 9 | Neuronal H2-D? expression decreases Ca”* 
permeability of AMPA receptors at retinogeniculate synapses in 

K’p’ '-;NSED?* mice. a, NASPM blockade of I ‘ampa is significantly reduced 
in K°D®-/";NSED®* mice. Example trace of NASPM effect on I,mpa recorded 
from K°D?~'";NSED?— (left) or K’p’!" ;NSED’* (right) individual LGN 
neuron. Grey line, before NASPM; black line, after NASPM application 

(5 traces averaged for single cell); 20 uM SR95531 in bath. b, Internal 


Internal: + spermine 


Bath: + SR95531 (20 uM) 
+ APV (100 uM) 


spermine-dependent block of I,yypa at positive membrane potentials is rescued 
in K’D®~'";NSED®* LGN neurons. Example recordings for I,yspq normalized 
to EPSC amplitude at —40 mV from individual neurons. Note reduction in 
EPSC amplitude at +40 mV in K’p’'-; NSED°~, but restored to wild-type 
level in K’D’~'~;NSED”*. 100 1M APV + 20 uM SR95531 in bath. Spermine 
(100 uM) and D600 (100 1M) in internal solution. 
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Gamma-ray bursts (GRBs) are most probably powered by collimated 
relativistic outflows (jets) from accreting black holes at cosmological 
distances. Bright afterglows are produced when the outflow collides 
with the ambient medium. Afterglow polarization directly probes 
the magnetic properties of the jet when measured minutes after the 
burst, and it probes the geometric properties of the jet and the ambi- 
ent medium when measured hours to days after the burst’. High 
values of optical polarization detected minutes after the burst of GRB 
120308A indicate the presence of large-scale ordered magnetic fields 
originating from the central engine’ (the power source of the GRB). 
Theoretical models predict low degrees of linear polarization and no 
circular polarization at late times® *, when the energy in the original 
ejecta is quickly transferred to the ambient medium and propagates 
farther into the medium asa blast wave. Here we report the detection 
of circularly polarized light in the afterglow of GRB 121024A, mea- 
sured 0.15 days after the burst. We show that the circular polariza- 
tion is intrinsic to the afterglow and unlikely to be produced by dust 
scattering or plasma propagation effects. A possible explanation is to 
invoke anisotropic (rather than the commonly assumed isotropic) 
electron pitch-angle distributions, and we suggest that new models 
are required to produce the complex microphysics of realistic shocks 
in relativistic jets’. 

Magnetic fields play a crucial role in the physics of relativistic jets— 
for example, in their formation, acceleration and collimation’'*’*. On 
smaller spatial scales, there is a strong connection between particle accel- 
eration and magnetic field generation in the collisionless relativistic shocks 
that create GRB afterglows’""*. Our understanding of magnetic field 
properties in GRBs and their afterglows has improved rapidly through 
recent observational successes, such as time-resolved linear polarimetry of 
prompt y-ray emission’*’* (just seconds after the burst) and the observed 
transition of reverse shock emission (high levels of optical linear polar- 
ization, seen just minutes after the burst) to the early forward shock after- 
glow emission (lower levels of linear polarization)°. These observations 
have given support to models predicting large-scale ordered fields in the 
GRB ejecta’. On the other hand, late-time polarimetry (hours to days after 
the burst) offers the advantage that at these times the optical emission 
probably originates from a single emission process, namely synchrotron 


emission from the forward shock (that is, emission from the shocked am- 
bient medium), in which ordered fields are much less likely to be present. 
This then allows measurements of the geometry of the jet through mon- 
itoring of the late-time polarization angle” *'®”, but crucially also offers 
a simple test for afterglow microphysics via circular polarimetry—detailed 
models have been developed, but observational attempts have largely 
focused on radio reverse shocks, producing only upper limits’, and a 
single optical non-detection”’. 

GRB 121024A was detected by the Burst Alert Telescope (BAT) on 
board the Swift satellite at 02:56:12 UT on 24 October 2012"; a redshift 
of z = 2.298 was determined shortly afterwards (Methods). We obtained 
imaging polarimetry observations with the FOcal Reducer and low dis- 
persion Spectrograph (FORS2) on the Very Large Telescope (VLT), using 
a Wollaston prism and quarter- and half-wavelength plates (Extended 
Data Figs 1 and 2). Observations with the ESO Rgpecia filter commenced 
at 2.57 h after the burst, when the afterglow had an R-band brightness 
of R ~ 19.8 mag. After two sets of linear polarimetry (giving the per- 
centage of linear polarization, P);,), we obtained four consecutive mea- 
surements of the percentage of circular polarization (P.i,), followed by 
a further 9 measurements of P\i, (of which 4 were on the second night). 
Data reduction and calibration follow standard procedures'’” (Methods). 
Simultaneously with the polarimetry, we monitored the afterglow optical 
light curve with the Gamma-Ray Burst Optical/Near-Infrared Detec- 
tor (GROND) instrument (Methods). In the following we will use a 
notation where the flux density F depends on frequency v and time fas 
Fx t *v *, with temporal decay index « and spectral energy index f. 

The X-ray light curve obtained by Swift (retrieved from the online 
Swift/XRT GRB light curve repository”®) is well described by three power- 
law segments, where the first break occurs at fpreak,1 = 619732 s and 
the second break at threak.2 = 3.4133 x 104 s (errors at 90% confidence 
level). In the GROND light curve, we find evidence of a break at a time 
consistent with this last X-ray break (Extended Data Fig. 3). A combined 
fit to the X-ray and GROND data, using a smoothly broken power law 
and a host galaxy component, gives a best-fitting break time threak,2 = 
3.72 + 0.07 X 10*s (Methods), pre-break light-curve decay indices %preopt = 
0.93 + 0.02 and %prex—ray = 0.96 = 0.11, and post-break light-curve decay 
indices &postopt = 1.25 + 0.04 and o&postx—ray = 1.67 + 0.10 (uncertainties 
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Figure 1 | Linear polarization of the afterglow of GRB 121024A. a, The 
percentage of linear polarization as a function of time; b, the polarization angle. 
Red points are for GRB 1210244, green points are for the only other Swift GRB 
afterglow with extensive polarimetry, GRB 091018” (error bars are 1¢). The 
horizontal axis marks the time since trigger, normalized by the jet break time 


are 1a). The occurrence ofa light-curve break simultaneously in X-ray 
and optical wavelengths is suggestive of a jet break origin: a late-time, 
usually achromatic, light-curve break, generally associated with the finite 
opening angle of the jet that becomes apparent when the jet Lorentz 
factor decreases below a critical value. The X-ray+GROND spectral 
energy distribution is best fitted with a single power law with f = 0.88 + 
0.01 and an optical extinction Ay = 0.22 + 0.02 mag (Methods; Extended 
Data Fig. 4). The pre-break temporal and spectral indices agree with the 
standard fireball closure relations” in the situation where the synchro- 
tron cooling frequency v, and peak frequency v,,, are below both optical 
and X-ray frequencies”. 

Figure 1 shows the observed Pj;,, behaviour. Initially the source starts 
out ~5% polarized (much lower than the ~70% expected for a perfectly 
ordered magnetic field); this polarization subsequently decays to lower 
levels, while the polarization angle is remarkably constant. This shows 
that the magnetic field directions are largely random, that is, the coher- 
ence scales of the field in the blast wave are small, but their directions 
are confined to the plane of the shock (the detected polarization is attrib- 
uted to a somewhat off-axis viewing angle). In the second night of data, 
the polarization angle is markedly different, consistent (within errors) 
with a change of exactly 90° (Fig. 1). We consider this an unambiguous 
detection of the 90° angle change predicted to occur around the jet break 
time of a homogeneous jet that is not spreading sideways’. The exact 
time at which the angle change occurs is dictated by viewing angle’. The 
observed angle change shows that any ordered magnetic fields in the 
forward shock are weak, if present”. 

We acquired the Pi, measurements shown in Fig. 2 between the second 
and third Pi, datapoints. Under the assumption that during this inter- 
val no variability in P.i, is expected (the time covered is small compared 
to the time after the burst), we combine the 4 measurements together, 
and measure P.i, = 0.61% + 0.13% (see Methods). We estimate the per- 
centage of linear polarization during the circular polarimetry interval to 
be Pin ~ 4%: the afterglow shows a ratio P.ir/Plin ~ 0.15, several orders 
of magnitude above basic model predictions (~10~* at optical wave- 
lengths**; see Methods) and other measured relativistic jet sources” 
(Fig. 3). 

A high level of circular polarization can be intrinsic to the source (for 
example, the P.i, of the synchrotron emission from the source) or have 
its origin in propagation effects within the source (for example, Faraday 
conversion Pj;,, > P.i,, which is effective in a hot, relativistic medium) or 
by dust scattering effects along the line of sight. Plasma propagation effects 
within the source have been shown to be strong at long wavelengths (close 
to the synchrotron self-absorption frequency, v,), but negligible at optical 
wavelengths**: it is very unlikely that these effects play a role here. 

The influence of dust is limited to the host galaxy, as the Galactic extinc- 
tion towards GRB 121024A is very small, E(B — V) = 0.10 (Methods). 
Dust affects optical P.;, through four possible routes: multiple scatter- 
ing in an optically thick medium of dust grains; dichroic scattering by 
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t/ toreak,2 


(toreak2 = 3-72 + 0.07 X 10*s). 0; and 0 show the average angles before the jet 
break; 0, . + 90 demonstrates the 90° angle change predicted for jet breaks 
of uniform jets. Horizontal double-headed arrows show the timespan over 
which the circular polarimetry was obtained. 


(somewhat aligned) non-spherical dust grains; dichroic extinction of 
linearly polarized radiation by (somewhat aligned) non-spherical dust 
grains; and the scattering of linearly polarized radiation by randomly 
oriented dust particles (if the polarization is not in, or perpendicular to, 
the scattering plane). The P.i, expected to be caused by the last three 
effects depends strongly on the degree of alignment, source inclination 
and the total amount of dust involved, for which line-of-sight extinction 
may bea proxy~™*”’. The weak line-of-sight host galaxy extinction, Ay = 
0.22 + 0.02 mag (Methods), argues against multiple-scattering effects”. 
To rule out the other possibilities, we simulated the efficiency of con- 
version of P\;, > Pir through single dust scattering (that is, where each 
photon has been scattered at most once bya dust particle), finding max- 
imal conversion values at high and low scattering angle regimes of ~ 10% 
at ~100°, and ~8% at ~20°. Models based on partially aligned dust 
grains have lower efficiency. Ifa large part of P,j, is in fact due to dust, we 
expect a large fraction, if not all, of the Pj, to be caused by dust scattering 
too. The linear polarization curve (Fig. 1) shows no signs of this: we would 
not expect to see a constant angle before the jet break (at t,,eak,2), nor for 
Py, to reach near zero at any point, nor a clear 90° angle change over the 
jet break’®. In addition, light scattered at large scattering angles has not 
had time to reach us yet (dust very close to the GRB gets destroyed by the 
GRB prompt emission). After eliminating plasma propagation effects 
and dust scattering, we therefore conclude that the measured Pi, is likely 
to be largely or fully intrinsic to the afterglow. It is of some interest to 
compare this with the only other afterglow with deep measurements, 
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Figure 2 | Optical circular polarization measurements of the afterglow of 
GRB 121024A. a, Optical circular polarimetry of the afterglow of GRB 
121024A; b, optical circular polarimetry for GRB 091018" (error bars are 10). 
The horizontal axis shows the time since burst in the observer frame, and the 
vertical axis shows the circular polarization expressed in Stokes parameters V 
and I (see Methods) as V/J, in per cent. The short-dashed line indicates V/I = 0, 
the dashed line shows the measured V/I from the combined data points, and lo 
uncertainty values around the best combined data value are shown with 

solid lines. 
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Figure 3 | Optical polarimetry of quasars and GRB afterglows compared. 
Polarization measurements for GRBs 121024A and 091018 (diamond symbols) 
are shown with optical polarimetry of quasars”* (squares); error bars are lo. 
The data points are colour-coded for redshift. The few quasars with >30 
detection of optical circular polarization have high values (=20%) of linear 
polarization. GRB 121024A has the highest detected level of optical circular 
polarization, and at z = 2.298 is the highest-redshift object with a detection, 
showing the potential for GRBs as probes of possible cosmological propagation 
effects. We note that the range of redshifts means that different rest-frame 
wavelengths are being compared. 


that of GRB 091018, which showed limits’’ P.j, < 0.15% (20) and P.,/ 
Piin < 1 (Figs 1, 2 and 3). 

The origin of the optical circular polarization in the afterglow of GRB 
121024<A is puzzling. We expect intrinsic polarization from the external 
shock synchrotron emission of Por), 1 where y, is the random Lorentz 
factor of the electrons emitting the optical radiation—under the assump- 
tion of isotropic electron pitch-angle distribution and perfectly ordered 
magnetic field® (Methods). In this situation, the observed P.;, would 
imply extremely low values of y,. Furthermore, because P.i, and Pin are 
both expected to be reduced from the value for a perfectly ordered field 
by the field randomness in the same way, the expected P.ir/Phin ratio is 
the same as for a perfectly ordered field, again scaling with y;' in the 
case of an isotropic electron distribution®. Therefore, the observed polar- 
imetric behaviour poses a challenge to the long-standing assumption of 
isotropic electron pitch-angle distributions in the GRB forward shock 
afterglow. Pitch-angle anisotropy has been postulated before as a pos- 
sible explanation of GRB phenomena as varied as steep decay phases in 
X-ray light curves” and spectra of prompt emission”’, but evidence for 
this has been lacking. Further hints of a more complicated structure of 
emission and acceleration regions come from observations as varied as 
high-energy emission in GRBs” and fast variability of high-energy emis- 
sion in quasars*’. The circular polarization of afterglows from GRBs 
as well as quasars (Fig. 3) offers a new line of evidence required to 
guide theoretical studies”””. 


METHODS SUMMARY 


Both linear and circular polarimetry of the afterglow of GRB 121024A were per- 
formed using the FORS2 instrument on the VLT, using the R,,.cia filter in imaging 
polarimetry mode, using a Wollaston prism and half- and quarter-wavelength plates. 
We used four plate angles for linear polarimetry and two for circular polarimetry, in 
order be able to use beam-switching to reduce systematic errors, and used aperture 
photometry to measure source fluxes. Measurements from polarimetric sequences 
taken on the second night after the burst were combined to increase signal-to-noise 
ratios. We used field stars to measure the linear polarization induced by Galactic 
dust, fitting the distribution in Stokes Q and U with a two-dimensional Gaussian 
function. We monitored the optical afterglow brightness in seven photometric 
filters using the GROND instrument, we fitted the X-ray data from the Swift sat- 
ellite together with the GROND data with absorbed power laws to establish the pre- 
sence of a late-time break in the light curve, and we fitted the X-ray/optical spectral 
energy distribution of the afterglow. Finally, we demonstrated how the observed 
high level of circular polarization contradicts theoretical estimates, and how 
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anisotropy in the electron pitch-angle distribution may explain the observed ratio 
of optical circular to linear polarization. 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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METHODS 

Linear and circular polarimetry. GRB 121024A triggered the Burst Alert Tele- 
scope (BAT) on board the Swift satellite at 02:56:12 UT on 24 October 2012 (Swift 
trigger 536580)'”. We will use this trigger time as f (that is, time since burst of an 
observation is t = tops — fo) throughout. The prompt emission shows the burst prob- 
ably belongs to the class of long bursts, with a duration to) = 69 + 32s (ref. 31). An 
X-ray and optical afterglow was found by the Swift X-ray telescope (XRT) and UV- 
optical telescope (UVOT)”. The redshift, z = 2.298, was found through afterglow 
spectroscopy with the X-shooter instrument on the Very Large Telescope”. Based 
on the initial brightness of the UVOT afterglow” we activated our VLT polari- 
metry programme (programme 090.D-0789, PI Wiersema). 

The data acquisition strategy, reduction and analysis closely follows that of our 
recent paper on GRB 091018”, which in turn follows the specific recommendations 
set out for FORS polarimetry**. We summarize those methods here, and give addi- 
tional details specific to the case of GRB121024A. 

Observations in the FORS2 Rgpeciai filter started at 9.2 10° after burst. Fol- 
lowing an acquisition image (Extended Data Fig. 1), polarimetry was acquired in 
imaging polarimetry (IPOL) mode. In this instrument mode, a super-achromatic 
half- or quarter-wavelength plate is used (for linear and circular polarimetry, respec- 
tively), after which a Wollaston prism is used to split the light into two beams, the 
so-called ordinary and extraordinary beam (hereafter the o and e beam) that have 
perpendicular polarization. These beams are imaged simultaneously: a slit mask is 
used to prevent overlap of the two beams on the chip (Extended Data Fig. 2). All 
linear polarization measurements are obtained using four rotation angles (0°, 22.5°, 
45° and 67.5°) of the half-wavelength plate; for the circular polarimetry we used 
two angles (—45° and +45°) of the quarter-wavelength plate. The GRB afterglow 
was positioned in the middle of the mask opening at the default (on-axis) position 
for FORS2 observations, on chip 1. A small number of linear polarization measure- 
ments were taken with a small dither (~16 pixels in the Y direction) to eliminate 
any potential effects of bad pixels/columns; none of these problems were apparent. 
Extended Data Tables 1 and 2 list the polarimetric observations and their resulting 
measurements. 

Data reduction was done within IRAF, using bias and sky flat frames obtained 
on the same nights. Analysis of the data was done via aperture photometry, using 
IRAE scripts developed for this purpose, using the following method. We perform 
aperture photometry of all point sources present on both FORS2 chip 1 and 2, on 
the o and e images, using an aperture radius of 1.5 times the on-frame full-width at 
half-maximum (FWHM) of the point spread function (PSF). The PSF is found via 
a Gaussian fit on all point source objects, on a per-image basis, and is determined 
independently for the o and e beam as small differences in PSF shape may occur 
between the beams, particularly for objects far off-axis. The PSF value was used to 
set the aperture size; for example, the o beam on chip 1 is the weighted average of 
the FWHM ofall point sources in that beam, chip and image. The sky subtraction 
was done using an annulus of inner and outer radii 3 and 4 times the FWHM, res- 
pectively. We only included sources for which the sky annulus was fully contained 
within the mask and showed no signs of saturation. Using this procedure we mea- 
sured fluxes f, and f, for all point-like objects in all frames. Errors were determined 
USING OP ounce =K | Xf + (na +n /ng) o%/pixel, where gis the gain, f the flux (f, 
and f., source minus the mean background) in the aperture, m4 and nz the number 
of pixels in aperture and background region; and o/pixel the variance per pixel in 
the sky level within the annulus. 

The read noise contribution to the errors is negligible in these data. Images are 
not combined (that is, the depth of coverage per pixel is 1). The purpose of using the 
four angles for linear polarization, and two for the circular, is that the beams switch, 
eliminating several systematic errors (for example, flat field defects), resulting in 
increased reliability’. We use the fluxes to derive the normalized flux difference at 
angle number ias F; = (foi — fe)/(fo,i + foi) = (foi — fei /I. We use the Stokes para- 
meters (U, Q, V, I) to describe the source polarization state, often in normalized form 
(U/L, Q/I, V/I), These can be expressed in terms of F;as Q/I= 7 x, F; cos(in/2) 
and U/I= . PS F; sin(in/2), where N is the number of half-wavelength plate 
positions (four Positions in this case)'”*°. The circular-polarization Stokes para- 
meter V/I is similarly computed as V/I = i (F4s — F_45). The benefit of the mul- 
tiple angle observations is clear from these equations. Further increasing the accuracy 
by observing at even more angles (for example, eight angles for linear polarization) 
is difficult for sources as faint as the afterglow of GRB 121024A—it would lead to 
observing times for single data points that are close to the polarization variability 
timescale. 

We now proceed to correct for the linear polarization induced by dust in our 
own Galaxy which, along the line of sight to GRB 121024A, has E(B — V) = 0.10 
(ref. 34). Using the relation Pj, = 0.09 X E(B — V) (ref. 35) this corresponds to a 
maximum induced polarization of ~0.9%, which may bea noticeable fraction of the 
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detected polarization. To correct for Galactic-dust-induced polarization, we use 
the same methods as we used for GRB 091018 (ref. 17): we fit the field star Q, U dis- 
tribution with a two-dimensional Gaussian function. We use three cuts on the 
sources entering the distribution: we require a polarimetric error <0.9%, a polar- 
ization value below 1.5% and a radial distance to the GRB position smaller than 
2.5’. This last cut is to counter the effects of instrumental polarization: the FORS2 
instrument shows a broadly radial instrumental linear polarization pattern, with 
nil polarization on-axis (where the GRB is positioned), and values increasing with 
radial distance****”’. By picking this cut-off we prevent the instrumental pattern from 
influencing the field objects’ Q, U centroid determination too much, while still re- 
taining enough sources for a reliable fit. We note that the number of bright field point 
sources within the mask is low in the case of GRB 121024A (87 data points from 9 
individual stars enter the fit, considerably lower than in the case of GRB 091018; 
ref, 17), resulting in a somewhat larger uncertainty in the Galactic-dust-induced Q, 
Uvalue. We find a value of Qgai = — 0.0020, Ugal = — 0.0046. We use the standard 
deviation of the fitted 2D Gaussian as a measure for the uncertainty on the Galactic- 
dust-induced Stokes parameters, finding 7g, = 0.0040, oy, = 0.0043. We correct 
the afterglow Q, U values by subtracting the Galactic-dust-induced Q, Uvalues. To 
facilitate comparison with models, we use the Stokes parameters to express the linear 
polarization in terms of the polarization degree Pj, and polarization angle 0 as 


1 
Pin = V/ (Q?+ U?)/Land 0= ; arctan(U/Q), where coordinates are chosen such 


that 0 = 0° for North, 0 = 90° for East. In the conversion from Q, U to Pi, 0 we ac- 
count for the effects of polarization bias**”’ in the same manner as done for the data 
of GRB 091018'”””. We correct the angles 0 for the FORS2 instrumental zero angle 


offset’”. Similarly we can compute Par = \/ (V/I)’: the sign of V/I gives the polar- 
ization direction (clockwise or anticlockwise). We note that both the instrument- 
induced circular polarization and linear to circular polarization crosstalk are well 
studied for the FORS instruments and far below our detection levels on the optical 
axis (where the afterglow is positioned)*****’, and as such we expect no instru- 
mental contribution to the detected V/I. A small amount of instrumental circular 
polarization is expected to be present for sources observed far off-axis**”’, far from 
the GRB position. Since there are no bright (but unsaturated) stars near the GRB 
position, it is therefore not possible to use field sources as secure independent sec- 
ondary standards****”’, The circular polarization induced by scattering by the dust 
in our own Galaxy, with E(B — V) = 0.10 and the induced linear polarization values 
as above, does not contribute significantly to the observed Pi, (ref. 40). 

In the second night of data, the afterglow of GRB 121024A shows a low level of 

linear polarization, which means that the uncertainty in the angle 0 is relatively large, 
as 69 = Op, /2P\in. Nevertheless a clear difference in angle is visible with respect to 
the first night of data, consistent with a 90° change in polarization angle: the angle 
changes from 172° + 2° to 85° + 10° (Extended Data Table 1). The two first obser- 
vations have angles somewhat discrepant from the pre-jet break average (Extended 
Data Table 1). Unfortunately we were not able to acquire infrared polarimetry simul- 
taneous with the R-band polarimetry for scheduling reasons—simultaneous multi- 
wavelength polarimetry is the best way to directly measure polarization induced by 
scattering off dust particles in the host galaxy***°*". However, the measured line-of- 
sight extinction and indeed the linear polarization light curve itself provide sufficient 
evidence that the host galaxy-dust-induced linear polarization must be small: the 
measured Ay ~ 0.22 mag gives a limit on the induced linear polarization of <0.7% 
(assuming a Milky Way-like polarization curve”). 
Light curves and spectral energy distribution. The Swift XRT light curve is well 
described by three power-law segments, with parameters (errors at 90% confidence 
level) 0) =1.78 7043, threak,1 =619 "3498, 2 =0.837053, fbreak,2 = 3.4133 X 104s 
and «3 = 1.70703” (ref. 20). The uncertainty in the late break time is relatively large, 
as there are few data points post-break. 

The optical afterglow of GRB 121024A was observed extensively with the seven 
channel (g’, 7’, i’, 2’, J, H, Ks filters) Gamma-Ray Burst Optical and Near-Infrared 
Detector (GROND”’) instrument, mounted on the ESO 2.2-m telescope at La Silla, 
Chile, providing photometric monitoring simultaneous to the polarimetry (Extended 
Data Table 3). These GROND observations will be described in more detail in a forth- 
coming, separate publication, but below we give some properties that are required to 
interpret the polarimetry. In addition to the GROND data, we observed the posi- 
tion of GRB 121024A with the ACAM instrument on the William Herschel Tele- 
scope (WHT) on 15 February 2013 (that is, 114.8 days after burst), with the aim of 
fixing the flux contribution of the host galaxy to the late-time light curve. We acquired 
14 X 180s exposures in Sloan r filter in relatively poor seeing conditions (~ 1.8"). The 
resulting image (Extended Data Fig. 1) shows a clear detection of a source at the 
afterglow position, which we identify as the host galaxy of this GRB, confirming that 
a late-time detection in GROND data in g, r, i is dominated by the host. Its bright- 
ness, r’ = 24.03 + 0.20, places the host at the bright end of the host galaxy lumin- 
osity distribution at this redshift*’. Taking this bright host galaxy magnitude into 
account, a light-curve break at around the same time as the break in XRT data is 
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apparent (Extended Data Fig. 3). We perform a joint fit to the XRT and GROND 
light curves (where only XRT data after the first XRT light-curve break, fhreak,1 are 
used) to constrain the break time. We fit using a model that consists of a smoothly 
broken power law, generally defined as 


t as t a%s\ —1/s 
RO=Fltea) (5) +5) | 
toreak bbreak 


where fpreak is the break time, « and % are pre- and post-break light-curve indices, 
and s is the break smoothness parameter. The pre- and post-break slopes and break 
smoothness are free parameters in our fit, and the break time is fixed to be the same 
for X-ray and optical/infrared wavelengths (that is, an achromatic break, and in 
addition a host galaxy contribution to the optical and infrared fluxes). This results 
in an acceptable fit statistic (reduced ° = 157.38/132 = 1.192), and the following 
parameters (uncertainties are 1c): break time tyreak,2 = 3.72 + 0.07 X 10*s and break 
smoothness s = 5.01 + 0.01; pre-break light-curve decay indices %preopt = 0.93 + 0.02, 
OpreX-ray = 0.96 + 0.11; post-break light-curve decay indices opostopt = 1.25 + 0.04, 
OpostX-ray = 1.67 + 0.10. We identify this late, achromatic, break fyreax.2 with a so- 
called jet break. The resulting fit is shown in Extended Data Fig. 3. While in each 
optical band there are only a few data points post-break, GROND observes in seven 
bands simultaneously, making the break significant. The relatively shallow post- 
break optical decay is probably caused by the combination of bright host and 
smooth break: by the time the light curve asymptotes to its post-break index it is 
dominated by host galaxy light. 

The line-of-sight extinction in the host galaxy and the spectral slopes are found 
by fitting an XRT +GROND spectral energy distribution (at time 11,085 s after trigger) 
with a SMC-like extinction law”. The best fit, with reduced ¢ = 1.04, is obtained 
bya single power law (a broken power law does not result in significant fit improve- 
ment) with parameter / = 0.88 + 0.01, and a optical extinction in the V band of 
Ay = 0.22 + 0.02 mag. 

The fact that X-ray and optical/infrared wavelengths have the same spectral index 
and that the pre-break decay indices are (within errors) identical, suggests that X-ray 
and optical are both in the same spectral regime, likely v > v,. The achromatic nature 
of the light-curve break is consistent with a jet break interpretation, supporting 
our interpretation of the linear polarization behaviour of this afterglow. 
Circular polarization of synchrotron emission. The linear and circular polariza- 
tion degrees of the optically thin synchrotron emission from the electrons with a 
spectrum given by N(y,,”) =Kyz ’f(«), wherey, is the electron Lorentz factor, « is 
the electron pitch angle and K is the normalization factor, are given by''* 
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Here 0 is the viewing angle with respect to the magnetic field direction, I" is the 
mathematical Gamma function, and we have defined 


_1 df(#) 
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These two equations are valid when g(0)<j,. 

If the electron pitch-angle distribution is isotropic, that is, (0) = 0, then Par ~y, }. 
This simply means that the circular polarization contributions of electrons with 
pitch angles 0 + « and 0 — « nearly cancel out, and the remaining P,;, scales with 
the angular size of the beaming cone of the synchrotron emission, y; !. 


g(= 


“4=0 


The electrons with Lorentz factor y, mainly contribute to the synchrotron 
| ere 
2nm,c) °° 1+z 
and the blast wave Lorentz factor J’ can be estimated by the standard external 
shock model’’. Therefore, by calculating the Lorentz factor of the electrons pro- 


ducing the R-band emission, one can predict P.;, at the observing time as 


emission at frequency v= where the magnetic field strength B 


Peir™10~ 405" Bah’ nl/® 


where &3 =0.01ég —» is the fraction of the magnetic energy density to the internal 
energy density, E = 10°°Es» (erg) is the total blast wave energy and n (cm 3) is the 
circumburst particle number density. This value does not strongly depend on the 
model parameters, and is very low (in spite of the assumption that the magnetic 
field is ordered) compared to the observed value of P.ir = 0.61 + 0.13%. In reality, 
the magnetic field directions are largely random, as implied by the observed Piin 
light curve. However, the linear and circular polarization degrees are reduced to the 
same extent by the randomness of the field, so the ratio P.i-/Piin ~ 10 * is applic- 
able also for the random field case, which is clearly inconsistent with the observed 
value Pcir/ Phin = 0.15. 

Ina situation where the pitch-angle distribution is not isotropic, the circular polar- 
ization contributions of electrons are not cancelled out and P,i, can be higher. The 
observed polarization ratio P.;,/Piin implies that g(0)/y, ~ 0.1 and then g(0) ~ 10°, 
which means a highly anisotropic pitch-angle distribution. 

The detection of high circular polarization implies that the emitting plasma con- 
sists mainly of electrons and protons, rather than electrons and positrons, because 
the circular polarizations of the synchrotron emission of electrons and positrons 
perfectly cancel out*’. This implication is consistent with the emission model of the 
forward shock propagating in the circumburst medium. 


31. Barthelmy, S. D. etal. GRB 121024A: Swift-BAT refined analysis. GCN Circ. 13889 
(2012). 

32. Tanvir, N.R. etal, GRB121024A: VLT/X-shooter redshift. GCN Circ. 13890 (2012). 

33. Patat, F.& Romaniello, M. Error analysis for dual-beam optical linear polarimetry. 
Publ. Astron. Soc. Pacif. 118, 146-161 (2006). 

34. Schlegel, D. J., Finkbeiner, D. P. & Davis, M. Maps of dust infrared emission for use 
in estimation of reddening and cosmic microwave background radiation 
foregrounds. Astrophys. J. 500, 525-553 (1998). 

35. Serkowski, K., Matheson, D. S. & Ford, V. L. Wavelength dependence of interstellar 
polarisation and ratio of total to selective extinction. Astrophys. J. 196, 261-290 
(1975). 

36. Bagnulo, S., Szeifert, T., Wade, G. A., Landstreet, J. D. & Mathys, G. Measuring 
magnetic fields of early-type stars with FORS1 at the VLT. Astron. Astrophys. 389, 
191-201 (2002). 

37. FORS Users Manual Issue 91.1 (ESO, Doc. no. VLT-MAN-ESO-13100-1543, 
2012); available at http://www.eso.org/sci/facilities/paranal/instruments/fors/ 
doc.html. 

38. Wardle, J. F.C. & Kronberg, P. P. The linear polarisation of quasi-stellar radio 
sources at 3.71 and 11.1 centimeters. Astrophys. J. 194, 249-255 (1974). 

39. Simmons, J. F.L. & Stewart, B. G. Point and interval estimation of the true unbiased 
degree of linear polarisation in the presence of low signal-to-noise ratios. Astron. 
Astrophys. 142, 100-106 (1985). 

40. Martin, P. G. Interstellar circular polarisation. Mon. Not. R. Astron. Soc. 159, 
179-190 (1972). 

41. Klose, S. et al. Prospects for multiwavelength polarisation observations of GRB 
afterglows and the case GRB 030329. Astron. Astrophys. 420, 899-903 (2004). 

42. Greiner, J. et al GROND — a 7-channel imager. Publ. Astron. Soc. Pacif. 120, 
405-424 (2008). 

43. Hjorth, J. et al. The optically unbiased gamma-ray burst host (TOUGH) survey. 
|. Survey design and catalogs. Astrophys. J. 756, 187-202 (2012). 

44. Pei, Y.C. Interstellar dust from the Milky Way to the Magellanic Clouds. Astrophys. J. 
395, 130-139 (1992). 

45. Melrose, D. B. Non-thermal Processes in Diffuse Magnetised Plasmas Vol. 1 (Gordon 
& Breach, 1980). 


©2014 Macmillan Publishers Limited. All rights reserved 


Extended Data Figure 1 | Host galaxy and afterglow image. Left, VLT lines. Right, detection of the host galaxy in the late-time WHT ACAM r-band 
FORS2 Rgpecia-band acquisition image, with the afterglow indicated by broken _ imaging. 
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Extended Data Figure 2 | Polarimetry mask and afterglow brightness. the sky that falls into the open part of the rectangular mask is imaged twice, in 
A small section of a single FORS2 Rgpecia-band polarimetric exposure (this is perpendicular polarizations, the o and e beams, which is why the same objects 
the —45° angle chip 1 frame of the cir4 set), illustrating the shape ofthe aperture _ each time appear in two non-overlapping strips. 

mask and brightness of the afterglow (indicated by an arrow). Each part of 
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Extended Data Figure 3 | Optical and X-ray afterglow light curves. a, Swift top right). Overplotted in a and c is the best-fitting smoothly broken power law 
XRT X-ray light curve in the time span covered by GROND observations; b, the | (Methods), with a host galaxy contribution to the optical data. Residuals of this 
full XRT light curve. c, Full GROND light curves in all seven bands (key at fit are shown in d. 
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shows the best-fitting power law without the effects of reddening and X-ray 
absorption. The horizontal error bars on the optical and X-ray data show the 
filter throughput and spectral bin-size, respectively. Vertical error bars show lo 
uncertainties on the fluxes. 


Extended Data Figure 4 | X-ray/optical spectral energy distribution of the 
afterglow of GRB 121024A. Shown isa spectral energy distribution using the 
seven GROND photometric bands and simultaneous Swift XRT X-ray data. 
The overplotted solid line is the best-fitting absorbed power law; the dashed line 
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Extended Data Table 1 | Linear polarimetry results 


Datapoint Time since to Pia 6 Exposure time 
identifier (mid; s) (%) (degrees) 

linl 9698 4.09+0.20 163.7+2.8 4x3 min 

lin2 10638 4.8340.20 160.34+2.3 4x3 min 

lin3 14782 3.82 + 0.20 182.7 + 3.0 4x4min 

lin4 16042 3.12+0.19 175.3+3.5 4x5 min 

lind 17425 3.39+0.18 178.0+2.9 4x5 min 

lin6 18841 3.49 + 0.18 180.3 + 3.0 4x5 min 

lin7 20227 3.20+0.18 174.543.3 4x5 min 

lin8 91624 0.3441.09 51.9+67.5 3 x (4x 5) min 
lin9 95828 1.49+0.78 93.14 26.6 3 x (4x 5) min 
linlO 103028 2.66+40.60 83.0+12.6 2x (4x 5) min 
linl1 105814 0.86 + 0.72 101.334+36.4 2x (4x5) min 


LETTER 


The polarization angle 0 follows a standard coordinate convention: north = 0°, east = 90°. The values are corrected for Galactic-dust-induced polarization in Stokes parameter space, and polarization bias 
corrections are performed. Note that for the lin8-lin11 datapoints, we combine multiple exposure sets together: the polarization is low and the source faint. 


©2014 Macmillan Publishers Limited. All rights reserved 


LETTER 


Extended Data Table 2 | Circular polarimetry results 


Datapoint Time sincetg V/I 


identifier (mid; s) (x 100) 

cirl 11643 0.87 + 0.21 
cir2 12373 0.30 + 0.21 
cir3 13103 0.53 + 0.22 
cir4 13832 0.75 + 0.23 


Each datapoint consists of two exposures (—45° and +45° angles) of 5-min exposure time each. The 
sign of V//is positive for all four data points (the sign distinguishes clockwise and anticlockwise circular 
polarization direction), and the circular polarization P.ir in per cent is therefore equal to the values in the 
third column. Uncertainties are lo. 
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Extended Data Table 3 | GROND optical and near-infrared photometry of the afterglow 


7 


7 


7 


7 


Time since to (mid; s)  g 7 4 z 

11085.19 20.73+0.08 19.830+0.07 19.5140.07 19.25+0.07 
15496.95 21.09+0.05 20.21+0.07 19.84+0.06 19.63+0.07 
17006.44 21.22+0.05 20.31+0.07 19.94+0.06 19.68+0.07 
21430.12 21.45+0.27 20.54+0.07 20.20+0.07 19.93 + 0.09 
106998.09 22.79+0.24 21.99+0.10 21.70+0.12 21.48+0.14 
191934.13 23.3140.11 22.50+0.09 22.1340.10 21.8640.13 
2432398.02 24.24 + 0.08 24.20+0.09 23.80+0.14 24.16+0.15 
Time since tp (mid; s) J H Ks 

11085.19 18.67+0.08 18.33+0.09 17.96+0.11 

15496.95 19.03+0.07 18.70+0.08 18.33+40.10 

17006.44 19.07+0.07 18.69+0.08 18.56+0.10 

21430.12 19.42+0.08 18.90+0.10 18.86+0.13 

106998.09 20.99+0.25 20.68+0.30 20.29+0.3 

191934.13 > 22.03 > 21.35 > 20.53 

2432398.02 > 22.50 > 21.92 > 21.02 


Magnitudes are as observed; uncertainties are 1a; upper limits are 3c. 
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Classical shear cracks drive the onset of dry frictional 


motion 


Ilya Svetlizky!' & Jay Fineberg! 


Frictional processes entail the rupture’” of the ensemble of discrete 
contacts defining a frictional interface**. There are a variety of views 
on how best to describe the onset of dry frictional motion. These 
range from modelling friction with a single degree of freedom, 
a ‘friction coefficient’**, to theoretical treatments using dynamic 
fracture** to account for spatial and temporal dynamics along the 
interface. We investigated the onset of dry frictional motion by per- 
forming simultaneous high-speed measurements of the real contact 
area and the strain fields in the region surrounding propagating 
rupture tips within the dry (nominally flat) rough interfaces formed 
by brittle polymer blocks. Here we show that the transition from 
‘static to ‘dynamic’ friction is quantitatively described by classical 
singular solutions for the motion ofa rapid shear crack*”"’. We find 
that these singular solutions, originally derived to describe brittle 
fracture, are in excellent agreement with the experiments for slow 
propagation, whereas some significant discrepancies arise as the 
rupture velocity approaches the Rayleigh wave speed. In addition, 
the energy dissipated in the fracture of the contacts remains nearly 
constant throughout the entire range in which the rupture velocity 
is less than the Rayleigh wave speed, whereas the size of the dissip- 
ative zone undergoes a Lorentz-like contraction as the rupture velo- 
city approaches the Rayleigh wave speed. This coupling between 
friction and fracture is critical to our fundamental understanding 
of frictional motion and related processes, such as earthquake 
dynamics. 

A dry frictional interface is composed of an ensemble of discrete 
contacts whose total area (the real contact area, A) is orders of magnitude 
smaller than the nominal contact area*. Local motion (slip) is initiated 
when contacts are broken via propagating ruptures’. Characterization 
of the dynamic fields that drive these ruptures and how they couple to 
the dissipative mechanisms on the interface are therefore critical to 
our fundamental understanding of frictional motion. These fields also 
describe the elastic radiation emitted by the rupture whose form is 
important to interpretation of near-field seismic signals radiated by 
earthquakes'*”’. 

Linear elastic fracture mechanics’° (LEFM) is the fundamental theor- 
etical framework describing brittle fracture. LEFM predicts a universal 
singular form of stresses (and strains) in the vicinity of a crack tip. These 
singular stresses are only regularized by dissipative’* processes that take 
place as this singular limit is approached. Shear-driven (mode II) fracture 
is generally considered to be impossible in bulk isotropic materials’ 
because it is believed that a crack will rotate under imposed shear so 
as to fracture under pure tension. An exception to this may occur for 
highly anisotropic conditions, as in the case of the weak interface plane 
that defines a frictional interface’*'*"'*. Examples of such systems range 
from natural faults to single adhesive contacts. Although both contact 
separation® and earthquake dynamics*”!*"*"* have long been theoret- 
ically described using fracture mechanics, there has been no direct 
experimental evidence that quantitative predictions of LEFM, such as 
the universal singular forms of stresses and material motion at tips of 
rapidly moving ruptures, really occur in frictional failure. We will dem- 
onstrate that, in the brittle material studied, frictional ruptures along dry 


and (nominally flat) rough interfaces can indeed be quantitatively 
described by LEFM. 

We study the structure of the elastic fields driving ruptures that prop- 
agate within an interface formed between two initially flat poly(methyl- 
methacrylate) blocks, which were uniformly roughened to a 3-m r.m.s. 
roughness. The blocks are pressed together with an externally imposed 
normal force, Fy = 5,500 N (5 MPa of nominal pressure). Once Fy is 
applied, a shear force, Fs, is quasi-statically incremented until stick-slip 
motion initiates (Fig. 1a). The complete two-dimensional strain tensor, 
é,(t), is measured at 19 locations along and ~3.5 mm above the interface 
at 1,000,000 samples per second (Fig. 1a). We simultaneously measure 
the displacement parallel to the interface, u,, at specific locations adja- 
cent to the strain gages. At the same time, we measure the real contact 
area, A(x,f), with high spatial resolution at 580,000 frames per second 
(see Methods). We focus on the short time intervals bracketing slip 
initiation. 


0.5 1 1.5 -40 -20 0 20 40 
t (ms) X — Xj, (mm) 
Figure 1 | Experimental system and slow rupture scaling. a, 19 Rosette strain 
gauges (blue squares) are mounted 3.5 mm above the frictional interface. The 
three strain components were measured simultaneously every 1 pls. b, The 
evolution of real contact area, A(x,f) (normalized at nucleation time, t = 0), 
along the 200-mm quasi-one-dimensional interface of a slow rupture front, 
which nucleated at x = 0 and accelerated to 0.3CR. ¢, A(f) (top) and i,(t) 
(bottom) (u, is the x component of the displacement field measured 3.5 mm 
above the interface) at locations x, (blue dots) and x, (red dots) denoted in b. 
d, Collapse of both A(x,t) (top) and i, /C; (bottom) is obtained by plotting the 
data as a function of the distance x — x, when the rupture tip, xp, arrives at x, 
(blue, at time f,) and x, (red, at time t,). Cris the local front velocity. A(x,t;) was 
measured directly whereas the spatially dependent u(x) was constructed from 
the time series u,(f) using t,(x,t) =u,(x — Crt). Variations of C; (x) are 
accounted for as described in Methods. During the front’s passage, — tt, /Cr 
corresponds to changes of the strain component parallel to the interface, Ag... 
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We define the location of a rupture front, x;jp, as the point where the 
initial value of A(x,t) is reduced by 3%. x,;, both represents the fracture 
of contacts and signals the initiation of local slip’. For a steadily moving 
front, &;(x,t) = &j(« — Ce), where Cris the rupture front velocity. When 
Cevaries slowly, g;(x,t) = ey(x — —|C; dt), enabling conversion from tem- 
poral é;(t) to spatial measurements é;(x — xp), using Ct) obtained 
from Alot ) measurements (see Methods), For each front passage we 
define strain tensor variations, Aéj, by subtracting the initial strains 
from é,, and ¢,, and residual strain from é,,. 

We present a typical measurement of A(x,t) for a slow, but accelerat- 
ing, rupture front in Fig. 1b. C; is well below the Rayleigh wave speed, 
Cr (~1,255 ms’) over the entire duration of this experiment. As Cr 
increases, the timescale of the weakening process (drop of A(x,t)) short- 
ens (Fig. 1c, top) and material element velocities, 11,.(u is the displace- 
ment field), increase significantly (Fig. 1c, bottom). In contrast to A(x,t), 
which is an interfacial property, , constitutes a lower bound of the slip 
velocity at the interface, because it is measured 3.5mm from the inter- 
face (see Methods). When plotted in the reference frame of the moving 
rupture tip, x — Xp the A(x,f) collapse (Fig. 1d, top) to a single curve. 
Furthermore, u(x), when scaled by C; collapses to a well-defined func- 
tional form (Fig. 1d, bottom) corresponding to Aé. = — t,(x)/C¢ (Ex- 
tended Data Fig. 2). Note that contacts are broken for x — x43) <0 and 
unbroken when x — xtip > 0. 

Figure 2a demonstrates that for slow ruptures (~0.01Cg<C;< 
~0.3C) the measured variations of each strain tensor component, Agi, 
collapse to well-defined functional forms. This breaks down as C; in- 
creases; all Ae; amplitudes grow significantly while strain oscillations 


a b 0.15 
+ 0.96C, 
a 0 
2 
2 -0.15 
= ty) 
& 
4 0.25 
0.1 
e 
Oo 
= 
4 0 
50-25 0 25 50 
x X jp (MM) 


° 
y (mm) 

h [= ] oa 

[a_i 
5 @ © 
- wp 
ie) 
(c-01) “av 


X= Xtip (mm) 


Figure 2 | The functional form of the elastic strains. a, Measurements of 
strain tensor variations, Agj, after subtracting the initial strain from ¢,, and éyy 
and residual strain from é,,. Colours represent different slow front velocities 
(0.05Cp < Ce< 0.24Cg; see ey. Corresponding LEFM predictions (equation 
(1)) for plane strain boundary conditions are plotted in black (Cs = 0.2Cg). The 
fracture energy /’~ 1.1Jm ” is the sole free parameter. b, The mean value of 
Ag; for three typical measurements at C;~ 0.96Cg. Equation (1) is plotted 
(tack line) for the same Jas in a, and still describes the larger amplitudes and 
strong oscillations of rapidly propagating ruptures. c, Shear strain variations, 
Aé,,, surrounding the rupture tip predicted by equation (1) for Cp = 0.96Cp. 
The blue square denotes the strain gauge location relative to the approaching 
rupture tip. Note the strong angular dependence that drives the violent 
oscillations in b. 
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are strongly amplified. Typical measurements of Ae, for high velocities 
(0.96CR) are presented in Fig. 2b. 

We now compare the measured strains, ¢;, to the universal asymp- 
totic solutions for mode II cracks predicted by LEFM. In our experi- 
ment, external tensile stresses (O2xF yy ) exist and the crack faces are 


subjected to a residual frictional shear stress, tT’, so that the stresses can 
r 


oO 
be written as 0, =Aoj+ = . Owing to the linearity of the 


yy 
governing equations, the stress field variations, Ac, can be mapped" 


to the stress-free conditions that define the mode II crack problem. 
Therefore, near the rupture tip, Ao, have the singular form: 


Ky 


Aoi = Xj;(0,Cs) (1) 


where (7,0) are polar coordinates relative to the crack tip, Z1' i, Cr) is 


a known universal function and Ky is a scalar coefficient lace as the 
stress intensity factor’. Ao; are linearly related to the measured strain 
variations Aé; via the elastic moduli and the use of plane strain bound- 
ary conditions (see Methods). Because the strain gauges are displaced 
from the interface, each Ag,(x) measurement involves both radial 
(50 mm > r> 3.5 mm) and angular (0 <0 <1) variations (see Fig. 2c). 

In Fig. 2a we show that for slow ruptures all of the measured strain 
components Ag; agree well with the strains corresponding to equation 
(1) (black fies), where the only free parameter is Ky. The apparent 
data collapse in Fig. 2a occurs because these LEFM solutions for 
Ce< 0.4Cg are indistinguishable. Ky is related to G, the energy flux 
per unit area (the energy release rate), via"®: 


ge v’) 


Siu(CK (2) 


where f(C¢) is a known apes function that is fairly constant for 
low velocities and diverges as C— Ca. When G is balanced by the 
fracture energy I (the energy dissipated per unit area), equation (2) 
uniquely relates Ky for each Crto I. Using equation (1) to measure Ky, 
the energy balance condition (G = I”) therefore yields a single value 
I'~1.1+0.3Jm ’ forthe LEFM predictions presented in Fig. 2a over 
0.01CR = Ce< ~0.3Cp. 

A typical example of rupture at high C; (Fig. 2b) demonstrates that 
for the same constant value of J’ most characteristic features of Ag, 
observed at higher rupture velocities are also well described by equa- 
tion (1). For example, the violent high-amplitude strain oscillations 
that occur when the rupture tip passes beneath the measurement point 
(Fig. 2c) are due to the singular nature of Ej ACD Cr). Figure 2b, top, 
however, demonstrates that equation (1) clearly fails to describe Aé,, 
for x — Xtip > 0. 

To systematically study how Ag; varies over 0.01 < C;< 0.99Cg, we 
characterize Ag; variations using Sis SEyyy Ob xy and 580, as defined in 
the insets within Fig. 3. With the only input being the constant value of 
I measured at low velocities, we see that, for all Cz the theory quanti- 
tatively describes both the strain variations 6¢,,. and de,, (Fig. 3a) and 
the violent high-amplitude strain oscillations, ban, “Fig, 3b, top) 
evident in the example in Fig. 2b. 

Although 6¢,,, is also well-described by the theory for C)/Cg < 0.4, 
we find (Fig. 3b, bottom) that a systematic failure of the classic univer- 
sal form predicted by LEFM (equation (1)) occurs, as seen in Fig. 2b, 
top. These deviations are first observed at Ce~0.7C and increase 
dramatically as C;—> Cp; experiments reveal a sharp and systematic 
increase of 6é,), whereas the theory predicts vanishingly small values as 
Cr—> Cp. Note that for C7/Cp < 0.4 the apparent collapse of all strain 
components (Fig. 2a) is a natural consequence of the theory. 

We now consider the value of I. Whereas the bulk fracture energy, 
Tou = 2,000J m7’, is a material property of poly(methylmethacry- 
late), the interface value, I’, is directly related to A(x,t), which is deter- 
mined by Fy. It is interesting that the measured value 7 = 1.1Jm 7 for 
our experimental conditions is approximately I), when the sparse- 
ness of the contacts* (A < Apominal) is accounted for (see Methods). 
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Figure 3 | Systematic comparison of measured strains to LEFM. 

a, Measured elastic strains, ¢,,. (top, inset) and ¢,, (bottom, inset), are 
characterized by their signal amplitudes, de, and 6é,, (black arrows). Main 
panels: 5¢,, (top) and d¢,,, (bottom) are plotted for 0.01Cp < Ce< 0.99CR. 


yy 
b, We characterize Exy by the oscillatory amplitude, bene (top, inset) and the 


shear strain drop, SExy (bottom, inset). a, b, Cp = 1,255+10ms °, plotted 


This suggests that significant plastic deformation (the major contri- 
bution to [},y,) may also take place within the contacting asperities, as 
surmised in ref. 2. 

Although LEFM enables us to measure J, it does not address the 
details of the dissipative region. As A(x,f) is measured on the interface, 
it characterizes cohesive zone (interface weakening region) properties. 
Figure 4a reveals that the contact area reduction, AA, increases with C,, 
suggesting that corrections to our assumption of constant J” may be 
required at high velocities. J’ could indeed vary significantly in the 
singular (Cp~ Cp) region of Ae, while still being consistent with the 
data, as when approaching the singularity our finite (+10ms_') mea- 
surement accuracy of C; and Cp limits our ability to resolve I. 

Although the non-monotonic behaviour of A with x — xp suggests 
interesting dynamics as Cs— Ca, for reasons of simplicity we char- 
acterize the scale over which A(x,t) is reduced by single length scale, X, 
(Fig. 4a, bottom). We suggest that X. provides a direct measure of the 
cohesive zone size. Figure 4b demonstrates that X, is not constant, but 
systematically contracts with increasing C;. As Cs— Cp, X, appears to 
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Figure 4 | ‘Lorentz’ contraction of the cohesive zone. a, Top; the contact area 
reduction, A/Ao, increases for faster ruptures and non-monotonic behaviour is 
observed for C;>0.9Cg. Bottom, we characterize the weakening region 

(the reduction of A) by X,, the length scale where a 60% reduction of AA/Ao 
occurs. Contraction of X, with C;is apparent. b, X, as a function of Cz Colours 
represent the different slip events in Fig. 3. Black line: LEFM prediction, 

X. = X°/fn(Cp), where X°= X.(C-~ 0). X.(C= Cg) < 0.5 mm are limited by 
our temporal resolution. 
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strains are in units of 10 °. Colours represent different stick-slip sequences, 
where different measurement locations are connected by a solid line. Classic 
LEEM predictions, for constant ’°~ 1.1Jm , are plotted in black. Equation 
(1) successfully describes 8¢,,, 58), (a) and deQe (b, top) while systematically 
failing to describe 5, (b, bottom). 


approach zero. Fracture mechanics predicts such an effective ‘Lorentz 
contraction’ of all length scales in the propagation direction®'®’’. In 
particular, we expect X_ = X°/fn(C;), where fix(Cy) is the universal func- 
tion (equation (2)) relating Ky to G, which diverges at Cp and x? = 
X,(C; = 0). Figure 4b (black line) demonstrates that this general result 
of elastodynamic theory describes the measurements well. 

Figure 4b indicates that in the slow rupture regime, strain measure- 
ments were performed at scales of ~X,, which characterizes the scale 
of the cohesive zone. The universal form, predicted by equation (1), is 
expected to describe the elastic fields at distances far from the cohesive 
zone, but still small compared to other dimensions of the system". The 
good agreement with LEFM (Fig. 2a) is, nevertheless, consistent (see 
Extended Data Fig. 4) with simple cohesive zone models coupled to 
LEFM”. 

In this light, as X- contracts by nearly an order of magnitude as Cp— Cg, 
we should expect equation (1) to become increasingly more accurate in 
precisely the velocity regime where it performs badly (compare 6¢,, in 
Fig. 3b, bottom). Ob y(Cy), as Fig. 3b, inset, demonstrates, correlates with 
the initially imposed shear stresses ahead of the front. These initial stresses 
have been shown to govern rupture velocity selection’? *'. Higher order 
contributions to this LEFM solution (see Extended Data Fig. 5) do not 
correct these discrepancies. As ruptures where C;—> Cp are commonplace 
in both frictional failure and earthquake dynamics, understanding this 
failure of the classical LEFM solution is important; the form of the singular 
stress fields surrounding rapid ruptures has significant ramifications for 
earthquake branching and fault bifurcation”. 

Knowledge of /’ and X, enables us to estimate elusive but long- 
sought'*'®"'® dynamical quantities at the interface that include peak 
stresses and slip velocities. The contraction of X, tells us (see Methods) 
that quantities measured off-interface increasingly deviate from their 
interface values as Cs—> Cp; for example, the peak value (strain = 
0.8 X 10°) of 58, (= —i,/C,), which corresponds to u,~ 1 ms |, 
can be an order of magnitude higher on the interface’*. 

Whereas we have demonstrated that equations (1) and (2) provide 
an excellent description of the fields that drive frictional motion in a 
specific material under dry conditions, we expect the results to be 
generally valid as long as the assumptions underpinning the LEFM 
solution are obeyed. These are: (1) the dissipative scale is small com- 
pared to the size of the elastic region in which the stress singularity 
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develops; and (2) the residual stress, t', remains fairly constant. So long 
as the dissipative region is sufficiently confined, its precise nature (for 
example, whether granular, powdered or consisting of contacting aspe- 
rities) only contributes to the value of I'(C;,). This description may have 
to be adapted for lubricated boundaries where t’ may be strongly slip- 
rate dependent’, or even break down within highly damaged regions 
whose spatial extent is large compared to the surrounding elastic 
material. 

The measurements in Figs 2 and 3 may therefore provide a precise 
quantitative framework for describing frictional motion along rough, 
nominally flat, surfaces. Such high-speed strain measurements” are 
rarely performed adjacent to natural faults. It is interesting to consider 
if the universal description of strains observed here is also valid during 
earthquakes. If so, high-speed single-point measurements of Ag, adja- 
cent to faults could provide sensitive measurements of both earthquake 
velocities and J. These measurements could reveal rupture-velocity 
dependence of both intrinsic values of J’ and values resulting from 
near-fault damage. These measurements could also clearly differentiate 
between different types of fracture modes such as supershear’ or slow 
ruptures, which are increasingly observed in both experiments’*** and 
natural faults”*. Our results indicate that the structure of slow ruptures is 
characterized by LEFM. The mechanisms governing their appearance 
are, however, not yet clear and may depend on properties of the under- 
lying friction law (for example, a velocity strengthening range)*””*. 

Let us now consider the threshold for frictional failure. Recent stud- 
ies'***° have shown that the static friction coefficient is not a material 
property, but depends on the applied loading. The failure of bulk ma- 
terials is described by the Griffith condition—the critical energy release 
rate, Goi, being equal to J. We have shown that interface rupture pro- 
pagation is described by, essentially, the same framework. This suggests 
that an analogous ‘Griffith’ condition may exist for the onset of rupture 
propagation for a frictional interface, and by incorporating applied 
loading conditions it may be possible to calculate Geir. 


METHODS SUMMARY 


The Methods section contains additional material about sample preparation and 
characterization together with detailed descriptions of the loading system, real 
contact area measurements, slip and strain measurements. Also presented is a 
comparison between direct measurements of the material velocities and derived 
values obtained using the relation 62,. = — it, /C;. More detailed descriptions of 
the analysis techniques (comparison of LEFM to measured strains, converting 
temporal to spatial measurements) used in the manuscript are also presented. 
Included is additional information that demonstrates that both higher-order 
(non-singular) strain field terms and the existence of a (simple) cohesive zone 
of size X_ have negligible effects on the comparisons of equation (1) to the data 
measured in our experiments. We additionally include a brief section that shows 
how estimates of cohesive zone properties (peak stress, interface slip velocity, and 
weakening distance on the interface) can be obtained from our measurements of I” 
and X.. We also describe how we relate the measured value of I” to measured bulk 
values, I pun of the fracture energy. 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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METHODS 

Sample construction. Our experiments were conducted using poly(methylmetha- 
crylate) (PMMA) blocks of dimensions 200 mm X 100 mm X 5.5 mm (top block) 
and 290 mm X 28 mm X 30 mm (bottom block) in the x, y and z direction, respect- 
ively (see Fig. 1a). Material shear, Cs, and longitudinal, C,, wave speeds were obtained 
by measuring the time of flight of ultrasonic pulses, yielding Cs = 1,345 + 10ms* 
and C, = 2,700 + 10ms_'. These provide Cy = 1,255 + 10ms_', where Cp is the 
Rayleigh wave speed. The values yield dynamic values for the Poisson ratio of y ~ 1/3 
and Young’s modulus of E = 5.65 GPa. Note the value of E is significantly different 
from the static value of E = 3 GPa. This difference is due to the viscoelastic behaviour 
of PMMA*". The contact face of the top block was diamond-machined to optical 
flatness. The bottom block’s contact face was carefully polished first by 15-jum and 
later by smoother 9-1m diamond standard lapping film, yielding a 3-um r.m.s. sur- 
face roughness. 

Loading system. While the loading system was slightly modified and fitted for the 
purposes of the current research, the experimental system is described in detail 
elsewhere’*°. The top block is clamped at its top edge, while the bottom block is 
mounted on a low-friction linear translational stage (Fig. 1a). External normal load, 
Fy, was fixed at the beginning of an experiment by pressing the blocks together in 
the y direction. Subsequently, external shear force, Fs, was quasi-statically (40 Ns) 
applied (starting from Fs = 0) to the bottom block in the negative x direction via 
a load cell of stiffness 10” Nm '. Generally, this procedure results in an induced 
torque and consequently, strong gradients of normal stresses at the interface (see 
figure 1 in refs 19, 30). To partially compensate for this effect we varied the loading 
conditions by slightly tilting the top block before applying Fy. 

Real contact area measurements. Changes in the real contact area along the entire 
interface were measured by an optical method based on total internal reflection. 
Basic principles are presented in detail elsewhere’. A sheet of light, incident on 
the frictional interface at an angle well beyond the critical angle for total internal 
reflection, is reflected everywhere except at contact points (Extended Data Fig. 1a). 
This yields an instantaneous transmitted light intensity proportional to A over 
the entire (x X z) 200 X 5.5 mm interface. The transmitted light is continuously 
imaged (at a spatial resolution of 1,280 X 8 pixels) at 580,000 frames per second 
using a high speed camera, Phantom v710. Data acquisition is continuous. The 
data are temporarily stored in a circular buffer large enough to acquire 7 ms of data 
both before and after each event. Storage of the data for each dynamic slip event is 
triggered by a sensitive acoustic sensor that is coupled to the top plate of the system. 

The frictional interface is quasi-1D, as its width (z direction), 5.5 mm, is much 
smaller than any other dimensions of the block. The simultaneous measurements 
of A(x,t) along the entire 1D interface is obtained by integration of the acquired 
images over the 8 pixels in the z direction. 

We use a high power LED (CBT-90) as our source of light. This is in contrast 

to refs 32 and 33, where a laser sheet was used. One main advantage of a non- 
coherent light source is the absence of interference patterns and, therefore, a higher 
signal to noise ratio. Moreover, PMMA is a photo-elastic material, and non- 
polarized light neutralizes photoelastic effects that could affect the intensity of 
the transmitted light. 
Strain measurements. We use miniature Vishay 015RJ rosette strain gauges for 
local measurements of a 2D strain tensor. 19 such strain gages are mounted along 
and ~3.5 mm above the frictional interface, on one side of the upper block (Fig. 1a). 
All strain signals (57 channels) are amplified (gain ~400, ~1 MHz bandwidth) and 
simultaneously acquired to 14 bit accuracy by an ACQ132 digitizer (D-tAcq Solu- 
tions Ltd) which simultaneously acquires all strain signals at a 1 MHz rate. Acqui- 
sition is synchronized with the contact area measurements. 

Dimensions of the strain gauges are presented in Extended Data Fig. 1b. The 
active size of each strain component is 0.34 X 0.38mm. The measured strain 
components, &), &2, €3, are later converted to Exyp Exx0 Eyy by means of simple linear 
transformations: 


Eyy =€) 


Exy = 1/2(€3 —€2) (3) 


Exy = €3 + &2 — €] 


Direct displacement, u,, measurements:. Two Philtec D20 optical displacement 
sensors were used to monitor the local motion in the x direction, u,, at different 
locations along the top block (Extended Data Fig. 1c, for clarity only one is shown). 
The original operation method of the Philtec sensor is based on measuring the 
reflectance of the transmitted light from a mirror target moving in a direction 
parallel to the light beam. As we need to measure displacement, u,, perpendicular 
to the sensor’s axis (Extended Data Fig. 1c, right), we used the sensor in a different 
way—making use of the fact that the sensor is, in essence, a reflectance sensor. 
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Thus, if a larger area of the light spot is incident on the reflective target, a larger 
signal is measured. As the target moves (in the x direction), different fractions of 
the spot area are reflected and the measured signal varies with u, (Extended Data 
Fig. 1b, left). This signal is calibrated, and a ~0.1 jum resolution at an acquisition 
rate of 1 MHz is obtained. The sub-millimetre size of these sensors allows us to 
therefore obtain local measurements at extremely high precision and rapid rates. 
Local material velocity, i, measurements:. Local material velocity, i1,, is deter- 
mined by the numerical time derivative of the displacement, u,, measured by the 
displacement sensor. Measurements of u,, however, require delicate mounting 
and calibration of the displacement sensor. It is, therefore, extremely difficult to 
mount several sensors along the interface. In what follows, a more easily accessible 
technique is presented. 

If the rupture propagation velocity, Cs varies adiabatically, we can assume 
ux(x,t) = u(x — Crt). As &xy = =, we can obtain the local material velocity via 
~ = — Cre, (for convenience, in the text we denote os by i,). Therefore, the 
measured ¢,,. combined with measurement of C;can be used to measure the mater- 
ial velocities at each strain gauge site. As we measure &,, in 19 locations along the 
interface and C; is extracted from the instantaneous location of the rupture tip 
(measurements of A(x,t)), this technique enables a relatively dense measurement 
of it, along the interface without any additional preparations. 

This technique was checked explicitly in several experiments, when displace- 
ment sensors were mounted slightly below one of the strain gauges and simulta- 
neously measured. Extended Data Fig. 2 shows typical experiments where this 
technique is compared with direct displacement measurements. As the figure 
shows, the agreement between the two techniques is often within our measure- 
ment error. 

Converting temporal to spatial measurements. The distance between adjacent 
strain gauges (~ 1 cm) is too large to provide good resolution of the spatial variation 
of the rapidly propagating fronts (Fig. 2b). We implement a high temporal resolu- 
tion (1 ps) to overcome this difficulty. We assume that locally, the rupture is quasi- 
steady, &;;(x,t) = ¢j(x — C)). Therefore, temporal measurements of €4(x = xo,t) (Xo is 
the position of the strain gauge) and C, together with the transformation t— —tC, 
allow us to dramatically increase our spatial resolution. Extended Data Fig. 2 demon- 
strates typical examples where this quasi-steady assumption was verified. We com- 
pensated for slow spatial variations of C,(x) by using é;(x,) = («x — [Cadt) instead of 
Ey(ot) = &,(x — Cp), where Ct) was obtained from A(x,f) measurements. 
Comparing LEFM to measured strains. Strain gauge finite dimensions. Both the 
finite dimensions (Extended Data Fig. 1b) of the strain gauges and the temporal 
resolution of the acquisition system should be taken into account for proper 
interpretation of the measured strain signals of rapidly propagating ruptures. In 
order to accurately compare theory and measurements we convert the strain 
component é), & 5 €,, predicted by LEFM, to ¢), é, €3 (for orientation of the 
measured strain components, see Extended Data Fig. 1b) by the inverse trans- 
formation of equation (3). After &, &, €; were spatially averaged in accordance 
with both the geometry of the strain gauge components and the temporal differ- 
ences due to the finite distances between the components, theoretical values cor- 
responding to the measured ¢,; were obtained. These were the basis of the 
comparisons performed in Figs 2 and 3. 

Boundary conditions. The universal LEFM solution is given’ for both plane 
stress (o,, = 0) and plane strain (¢,, = 0) boundary conditions. Plane stress bound- 
ary conditions are considered to be relevant when a thin plate with traction-free 
faces is considered. This assumption should be violated close to the frictional 
interface where the material is pinned—and therefore not allowed to expand freely 
in z direction. Moreover, when short wavelength (compared to the width of the 
sample) signals are considered, plane strain boundary conditions are generally 
more appropriate. This might be relevant for rapid ruptures. 

In our experiments we expect that some mixture of the two boundary conditions 
might take place. Extended Data Fig. 3 demonstrates that while both solutions have 
similar characteristic features some differences are apparent; measurements of &,.. 
are slightly better described by the plane stress solution, whereas measurements of 
éyy are significantly better described by plane strain. 

These considerations, however, are beyond the scope of the current work and in 
the main text we compare our measurements with the solution dictated by plane 
strain boundary conditions alone. 

LEFM coupled to simple cohesive zone model. The universal square-root sin- 
gularity, predicted by LEFM, must be regularized by dissipative processes taking 
place in the vicinity of the crack tip. Therefore, it should be interpreted as ‘inter- 
mediate asymptotics’; the universal functional form is expected to describe the 
elastic fields at distances far from the interface weakening region, X,, but still small 
compared to other dimensions of the system’. This assumption is known as ‘small 
scale yielding’. For high Cs X; is sufficiently small to have a clear separation of 
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scales. In the slow rupture regime the measurements were performed at a height 
h = 3.5 mm above the interface, which is a scale that is comparable to X.(~2.5 mm 
at low velocities). The measured strains, however, are surprisingly well-described 
by the theory. This observation is perfectly compatible with simple cohesive zone 
models'’”*, as we demonstrate in Extended Data Fig. 4. 

The effect of higher order terms. The complete LEFM stress field consists of a 
singular, r~ 1/2 term together with additional (non-universal) terms of the form 
r'/2, 73/2, and so on, which vanish at the tip but may be significant at distances 
away from the crack tip. 

Following the general solution procedure described in refs 10 or 22 we can 
derive the full angular dependence of these contributions to the stress field: 


ni 5 n0g aw) a nd, 
Oxy (150) = Cun) 4014054 COS ae —(1+02) ys cos| — 


n2a.(1+0) (2 (nOg Zz (no 
2 S S Je nei s 
C,,(2mr) NG) (:isin( 5 ) ys sin( i )) 


where the functions %;,d,),,/q,0s,0a,D are defined in ref. 10, n = —1,1,3,5...(odd 
terms are considered here) and C,, are numerical constants. For 1m = —1 we recon- 
struct the universal form with C-, = Ky. Generally, for n > —1 the constants C,, 
are not universal and cannot be easily determined. 

We now demonstrate that these terms will not resolve the discrepancy, observed 
for high C;, in Az, between the singular LEFM term and the measured signal, as 
presented in Fig. 3b, bottom. For simplicity, we consider the contribution of the 
terms n = 1,3. We attempt to resolve the discrepancies in A¢,, in the range 0 < x — 
Xtip by varying C, and C; under the constraint that we can still reasonably describe 
Aé,,.and Aéyy. As Extended Data Fig. 5 demonstrates, these terms are insufficient 
to resolve the Aé,, discrepancies in the range 0 <x — xp < 25mm while still de- 
manding consistency with the measured data for |x — xy)| > 25 mm. Assuming that 
the next terms (n > 3) will dominate at even larger distance, we conclude that, within 
this framework, the observed discrepancy is not resolved. 

Estimating cohesive zone properties. Our measured values of ‘and X. can provide 
rough estimates of cohesive zone properties. Let us assume a slip weakening model'*'” 
in which the stress singularity predicted by LEFM is regularized by a peak shear stress, 
Tp, that degrades to a residual strength, t,, over a slip distance, d.. For simplicity, the 
weakening within the cohesive zone is assumed to be linear. It is then found that for 


9 K; 
slow ruptures, X? = a x poe where Ky is calculated from I” = [(1 — v’)/E] Ky? 
Tp —Tr 


Oyy(1,0) = 


(ref. 11). Using the values J’ and X? from the text and the dynamic values of the 
Poisson ratio and Young’s modulus, we find 1, — 1, ~ 1.6 + 0.4 MPa. For the 
conditions applied in the current experiments, 1, ~ 3.7 + 0.4 MPa. This leads to 
an estimation of tp ~ 5.3 + 0.8 MPa. The linear slip weakening model also esti- 
mates that "= 1/2(t, — 1,)d.. This yields d. ~ 1.4 jum, which is consistent with 
the asperity size of our rough interface. 

Using this value of d., we can estimate, using simple scaling arguments””™’, the 
maximal slip velocity Vinax ~ Crd/X_. Thus, the contraction of X, in Fig. 4c indicates 
strong growth of V,ax, Which was shown in recent numerical simulations**. Using 
ity = — Créxx we obtain, for example in Fig. 2b at C; = 0.96Cp, a maximal value of 
it, ~0.3ms | measured 3.5 mm above the interface. At the interface, for the same 
C,, the model then predicts 7 times larger velocity (u, ~2m s }). Following this 
procedure, our maximal measured velocity of ~1 ms‘ (Fig. 3a) corresponds to at 
least 1. 10ms ? at the interface. 

This discussion suggests that caution should be used when interpreting mea- 

surements at a finite distance from the interface as representative of processes at 
the interface. 
Considering the value of /? For our experimental conditions, A is approximately 
0.5% of the nominal contact area‘. For different applied normal stresses this value 
will change. In addition, A is typically reduced (Fig. 4a) by about 20% during the 
rupture process. As a result, the value of 7 ~1.1Jm * results from a reduction 
of approximately 0.1% of the nominal fracture area. I’ therefore corresponds to a 
bulk fracture energy (a 100% reduction of the nominal fracture energy) of about 
Tuk 1,100 J m*. This value is close to the measured value of gu for mode I 
fracture experiments in PMMA” and had been previously estimated for frictional 
dissipation using a thermal argument”. 
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Extended Data Figure 1 | Experimental techniques. a, A method based on 
total internal reflection measures instantaneous changes in the real contact 
area, A(x,t), along the entire interface. A sheet of light incident on the frictional 
interface is totally reflected everywhere except at contact points. Top block, 
pink; bottom block, blue. b, Geometry and dimensions (in mm) of a single 
rosette strain gauge. The black rectangles represent the active area of the 
measuring components, &), 62 and &3. Yellow arrows represent the direction of 
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the measured strains. c, By placing a reflective displacement sensor at the edge 
of a reflecting film (left), the change in the reflected signal is a monotonic 
function of the displacement of the film. The reflecting area was calibrated 

to obtain high resolution measurements of the motion of the strip edge in 
the x direction (right). a-c, Measurements described above were acquired 
continuously. A sensitive acoustic sensor, mounted on the top block, triggered 
storage of all data bracketing every slip event. 
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Extended Data Figure 2 | Comparing different techniques for measuring 
uy. In several experiments, direct displacement measurements were performed 
slightly below the strain gauge. The numerical time derivative of the 
corresponding signal is presented in blue. We reconstruct u, by an alternative 
method (green) that is based on measuring C, Aé,, and the assumptions 
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presented in the text. a, b, Comparisons of i, by using both methods for two 
typical examples at C; = 0.3CR (a) and C; = 0.9Cg (b) show good agreement 
between the measurements. x axes are the times relative to the time, hip» when 
the rupture fronts passed each measurement location. 
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Extended Data Figure 3 | LEFM solution for plane strain and plane stress _b. The solution for plane strain boundary conditions is shown in black. The 
boundary conditions. a, b, Strain measurements of slow (a) and rapid solution for plane stress boundary conditions is shown in yellow. We see that 
(b) ruptures as presented in Fig. 2b. The top (bottom) panels are the strain plane strain conditions, as might be enforced by frictional pinning at the 
components é,, (éyy) relative to their initial values immediately priortorupture. _ interface, provide a better fit to the data. 

The data presented are coloured as noted in the keys in the upper panel ofa and 
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Extended Data Figure 4 | Comparing LEFM to linear cohesive zone model 
predictions. Strain measurements of a slow rupture as presented in Fig. 2a. 
Top, the strain component é,,, relative to the residual strain after the rupture 
passage. The centre (bottom) panels are the strain components ¢,, (¢,,) relative 
to their initial values immediately prior to rupture. The data presented are 
coloured as noted in the key in the upper panel. Black solid line, the universal 
LEFM solution at 3.5 mm above the interface. Yellow dashed line, prediction of 
LEEM coupled to a linear slip weakening model’ for X, = 2.5 mm and using 
the same value of Jas used in the LEFM solution 3.5 mm above the interface. In 
the slow rupture regime, for h/X, ~ 1 (his distance of the strain gauge above the 
interface) linear slip weakening model is indistinguishable from the singular 
LEFM solution. 
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Extended Data Figure 5 | Contributions of higher-order terms to LEFM. 
Strain measurements of a rapid rupture as presented in Fig. 2b. Top, the strain 
component é,,, relative to the residual strain after the rupture passage. The 
centre (bottom) panels are the strain components é,., (yy) relative to their initial 
values immediately prior to rupture. The data presented are for three different 
measurements in which C; = 0.96Cg. The key in the upper panel denotes the 
terms that were considered for each coloured line. No value of the r”* (C,) and 
7’? (C3) coefficients could both resolve the discrepancy in Aé,, and reasonably 
fit Az,, and Ag... 
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Tropical forcing of the recent rapid Arctic warming 
in northeastern Canada and Greenland 


Qinghua Ding’, John M. Wallace”, David S. Battisti’, Eric J. Steig', Ailie J. E. Gallant?, Hyung-Jin Kim* & Lei Geng? 


Rapid Arctic warming and sea-ice reduction in the Arctic Ocean are 
widely attributed to anthropogenic climate change’ *. The Arctic 
warming exceeds the global average warming because of feedbacks 
that include sea-ice reduction*’ and other dynamical and radiative 
feedbacks*"*. We find that the most prominent annual mean surface 
and tropospheric warming in the Arctic since 1979 has occurred in 
northeastern Canada and Greenland. In this region, much of the 
year-to-year temperature variability is associated with the leading 
mode of large-scale circulation variability in the North Atlantic, 
namely, the North Atlantic Oscillation'’*’’. Here we show that the 
recent warming in this region is strongly associated with a negative 
trend in the North Atlantic Oscillation, which is a response to anom- 
alous Rossby wave-train activity originating in the tropical Pacific. 
Atmospheric model experiments forced by prescribed tropical sea sur- 
face temperatures simulate the observed circulation changes and asso- 
ciated tropospheric and surface warming over northeastern Canada 
and Greenland. Experiments from the Coupled Model Intercompar- 
ison Project Phase 5 (ref. 16) models with prescribed anthropogenic 
forcing show no similar circulation changes related to the North 
Atlantic Oscillation or associated tropospheric warming. This sug- 
gests that a substantial portion of recent warming in the northeast- 
ern Canada and Greenland sector of the Arctic arises from unforced 
natural variability. 

The recent Fifth Assessment Report of the Intergovernmental Panel on 
Climate Change’ anticipates the continuing retreat of sea ice and warm- 
ing accompanying future anthropogenic emissions of greenhouse gases 
and aerosols. However, natural variability, such as that associated with 
the Atlantic Multi-decadal Oscillation’, has been suggested to be an impor- 
tant driver of climate variations in the Arctic region and responsible for 
a portion of the recent warming trend. Some recent results also indicate 
that sea surface temperature (SST) changes outside the Arctic have played 
a role in forcing the recent tropospheric warming in the Arctic’’. 

In this study, we use observational analyses and modelling to explore 
the relative contributions of anthropogenic forcing and natural vari- 
ability to recent warming trends in the Arctic. We identify a specific 
tropical SST pattern that has contributed significantly to recent Arctic 
warming by changing the circulation over northern high latitudes. 

We use post-1979 observations only, because the analyses of geo- 
potential height (which approximates the actual height of a pressure 
surface above mean sea level) and other variables over the Northern Hemi- 
sphere polar region are more reliable during the modern satellite era’®. 
The most reliable reanalysis data sets (ERA-interim’? and MERRA”’) and 
station data”! agree that the most pronounced annual mean surface 
temperature warming in the Arctic since 1979 has occurred over north- 
eastern Canada, Greenland and the north of Siberia (Fig. 1a and Extended 
Data Fig. 1). Surface temperatures along the Siberian coast are highly 
negatively correlated (r ~ —0.7 to —0.9) with in situ sea-ice variability 
where the observed surface warming is greatest and can be related directly 
to the significant sea-ice reduction. The troposphere has also experienced 
an annual averaged warming in the Arctic. Tropospheric warming is most 


pronounced over northeastern Canada and Greenland, where it extends 
through the depth of the troposphere and has occurred in all seasons, 
with the largest magnitude and extent during boreal winter (December- 
February) (Fig. 1b, Extended Data Figs 2 and 3). The magnitude of the 
surface and tropospheric warming in the northeastern Canadian- 
Greenland sector of the Arctic is nearly twice as large as the Arctic- 
mean warming. 
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Figure 1 | Observed trend pattern of annual mean field for 1979-2012. 
Linear trend (per decade) of annual mean surface temperature (a), 300-850 hPa 
temperature (b), 200-hPa geopotential height (Z200; c) and the non-zonal 
component of 200-hPa geopotential height (d). In a, surface temperature is 
shown over land or ice; SST is shown over ocean. In d, purple vectors 

(units: 10°Pam?s ~, vectors less than 10° Pam?s ” are omitted) denote the 
wave activity flux associated with the eddy Z200 trend pattern. Grid points with 
trends that are statistically significant at the 99% confidence level are denoted 
by small crosses. The box in c indicates the domain over which data are 
averaged in Extended Data Fig. 6. EQ, Equator. 
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The spatially varying character of warming trends over the Arctic 
region suggests a role for atmospheric circulation changes. Annual mean 
200-hPa geopotential heights (Z200) in the upper troposphere have 
increased almost everywhere in the Northern Hemisphere polar region 
since 1979, with the greatest increases occurring over northeastern Canada 
and Greenland, coinciding with the greatest surface and tropospheric 
warming (Fig. 1c and Extended Data Fig. 4). Consistent with the local 
surface temperature trend, the positive trend in Z200 over northeast- 
ern Canada and Greenland has occurred in all seasons, and is largest 
during winter (Extended Data Fig. 5). The correlation (r) between 34- 
year annual mean Z200 and surface temperature in this region is 0.9, 
and it is 0.8 for the detrended component (Extended Data Fig. 6a). Both 
the Z200 and surface temperature time series exhibit decadal-scale vari- 
ability superimposed on the long-term trend. In all seasons, the seasonal 
and decadal anomalies in Z200 are highly correlated with the surface 
temperature anomalies (Extended Data Fig. 6b-e). 

Because the upper-level atmospheric circulation is relatively insensitive 
to anomalous surface warming in the polar regions”, it seems unlikely 
that the decadal-scale variations in Z200 in the 34-year record could be 
forced by local variations in surface temperature. Theory and a myriad 
of modelling studies suggest that causality operates in the opposite direc- 
tion: that is, that the changes in surface and tropospheric temperature 
are an adiabatic response to the changes in the upper-level circulation, 
which may have been remotely forced. 

The circulation trend is marked by rising geopotential heights over 
Greenland and near-steady heights along ~50° N over the North Atlantic 
(Fig. 1c). The pattern can be characterized as a uniform increase in geo- 
potential heights across the Arctic, combined with the negative polarity 
of the North Atlantic Oscillation (NAO). The NAO is the leading mode 
of circulation variability and the main driver of temperature variability 
and extreme winter weather events in the North Atlantic'*!. A year- 
round NAO index, given by the leading principal component of low- 
level geopotential height anomalies in the North Atlantic’, exhibits a 
strong (r = 0.7) correlation with surface temperature and Z200 anom- 
alies in northeastern Canada and Greenland (Extended Data Fig. 6a). 
The past three decades have been marked by a negative trend in the 
NAO index”’. 

Further insight into the causes of the remarkable warming trend over 
Greenland and northeastern Canada is gained by examining the trends 
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Figure 2 | Coupled patterns between annual mean tropical SST and 
Northern Hemisphere circulation for 1979-2012. a—d, Results of maximum 
covariance analysis for annual mean 1979-2012 Northern Hemisphere 
(0-88.5° N) 200-hPa geopotential height (Z200) and tropical (20° S to 20° N) 
SST. Shown in a are the patterns of Z200 (contours, interval 10 m) and tropical 
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in the eddy component of Z200, obtained by removing the zonal average 
at each latitude from the trends (Fig. 1c). The results (Fig. 1d) strongly 
suggest that the 200-hPa geopotential height increases over northeastern 
Canada and Greenland are associated with a Rossby wave train that 
originates in the tropical Pacific. The corresponding flux of wave activity 
exhibits a distinctive arc-shaped trajectory, extending from the central 
tropical Pacific towards the Arctic and returning to the tropics via the 
Atlantic sector. Coincident with the pattern of trends in the warming 
and anomalous circulation, SSTs (and rainfall) have increased signifi- 
cantly at the 99% confidence level throughout the tropics except for the 
central and eastern Pacific where they have remained nearly constant 
or even decreased slightly, as shown in Fig. 1a (Extended Data Fig. 7a). 
Motivated by two earlier studies**”’, we hypothesize that this pattern of 
SST change in the tropics has played a key role in forcing the extra- 
tropical wave train that has both warmed northeastern Canada and 
Greenland and induced the negative trend in the NAO in recent decades. 

Our hypothesis that the trend towards higher geopotential heights 
over northeastern Canada and Greenland has occurred in response to 
trends in tropical SST is supported by maximum covariance analysis 
(MCA)”*. Figure 2 shows the leading modes of co-variability between 
annual mean Northern Hemisphere Z200, and tropical SST for the 
period 1979-2012. The leading MCA mode explains 68% of the squared 
covariance between the hemispheric Z200 and tropical SST fields, and 
captures the typical El Nifio/Southern Oscillation (ENSO) signature in 
SST and the related atmospheric teleconnection pattern over the Pacific 
and North American sectors, with their familiar sequence of year-to- 
year variability. The patterns associated with the second mode resemble 
the trend in tropical SST and the wave-train pattern of the Z200 trend 
(Fig. 1a, c). The second MCA mode explains 28% of the squared covari- 
ance. The time series associated with this mode are strongly correlated 
(r = 0.91) and both exhibit a pronounced upward trend over the 34- 
year record. The time series of the SST pattern in mode 2 is highly cor- 
related with the surface temperature in northeastern Canada and Green- 
land and with the year-round NAO index (Extended Data Fig. 8). This 
mode is not merely a reflection of the trends in the two fields, because 
similar MCA results are obtained when the data are detrended before 
conducting the MCA calculation (Extended Data Fig. 9). Thus, the 
wave train that links the positive temperature trends in northeastern 
Canada and Greenland to the tropical SST anomalies characterizes both 
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SST (shading; colour scale) that accompany the first mode. c, Time series of 
the Z200 (red) and SST (blue) patterns shown in a. Panels b and d are the same 
as panels a and ¢, but for the second mode. Amplitudes in a and b are scaled by 
one standard deviation of the corresponding time series in ¢ and d. 
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interannual and interdecadal timescales. Compared to the circulation 
anomalies associated with typical ENSO, the circulation anomalies 
associated with the mode 2 pattern of SST anomalies exhibits a wave 
train that extends farther poleward and includes circulation anomalies 
over the North Atlantic that are very similar to those associated with the 
NAO. Thus, the recent 30-year trends in tropical SST—distinct from 
ENSO—may be partially responsible for the negative trend in the con- 
ventional NAO index. 

To further examine the relationship between tropical SST forcing and 
circulation variability in northeastern Canada and Greenland, we use an 
atmospheric general circulation model (ECHAM 4.6 (ref. 27)) to perform 
an ensemble of integrations, each of which is forced by the observed 
tropical SST from 1979-2012; in the extratropics, the atmosphere is 
coupled to a slab ocean with a thermodynamic sea-ice component. Ten 
experiments were run, each with different initial atmospheric condi- 
tions, and averaging over the ten experiments is used to filter out in- 
trinsic atmospheric variability. The ensemble-averaged change in the 
high-latitude circulation is, by construction, due to the remote tropical 
forcing and to the local amplification associated with sea-ice change. The 
ensemble-averaged trends in surface temperature, tropospheric temper- 
ature and Z200 in the Northern Hemisphere are shown in Fig. 3; in the 
western hemisphere they closely resemble the observed trends shown 
in Figs la—c, and are consistent with the trend patterns associated with 
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Figure 3 | Simulated 1979-2012 trend patterns in annual mean fields in 
ECHAM. Linear trend (per decade) of annual mean surface temperature (a), 
300-850 hPa temperature (b), Northern Hemisphere 200-hPa geopotential 
height (Z200; c) and the non-zonal component of 200-hPa geopotential 
height from 34-year simulation of ECHAM run forced by observed SST 
(1979-2012) in the tropics (30° N to 30° S) (d). In the extratropics, the 
atmosphere is coupled to a slab ocean model. In d, purple vectors (units: 

2.5 X 10° Pam?s , vectors less than 2.5 X 10*Pam?s 7 are omitted) denote 
the wave activity flux associated with the eddy Z200 trend pattern. Grid points 
with trends that are statistically significant at the 99% confidence level are 
denoted by small crosses. 
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the mode 2 in the MCA analysis. Rising 200-hPa heights and wide- 
spread warming over northeastern Canada and Greenland are an 
integral part of the response to the imposed tropical SST anomalies. 

As in the observations, the simulated circulation anomaly over the 
North Atlantic is similar to that observed during the negative phase of 
the NAO. As in the observations, the warming trends and atmospheric 
circulation anomalies are largest in winter. The simulated annual mean 
Z200, surface and tropospheric temperature trends over northeastern 
Canada and Greenland are about 10 m per decade, 0.4 °C per decade 
and 0.3 °C per decade respectively, roughly half the magnitude of the 
values in the ERA-interim reanalysis'’’. We note that the model, forced 
by the observed tropical SST change, also reproduces the observed rain- 
fall trend in the tropical Pacific (Extended Data Fig. 7b). The simulated 
wave-train path over the North Pacific and North Atlantic differs in 
detail from that observed (compare Figs 3d and 1d), possibly owing to 
subtle differences between the real and simulated climatological mean 
zonal flow. Nonetheless, the model results strongly suggest that a Rossby 
wave train associated with the observed tropical SST trend is respon- 
sible for the observed negative trend of the NAO and accounts for both 
the regional surface and tropospheric warming in northeastern Canada 
and Greenland. 

Last, we explore the contribution of external forcing to the recent 
regional warming in northeastern Canada and Greenland. We take the 
ensemble-average response of the 40 climate models that were forced 
by historical forcing and archived in the CMIP5 database historical 
experiment’®. We note that in these experiments, “historical forcing” 
includes both anthropogenic and natural forcing. 

The trends in the ensemble-average surface and tropospheric tem- 
perature from the historical forcing simulations are shown in Fig. 4a 
and b. Comparing Fig. 1a to Fig. 4a, we see that the ensemble average of 
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Figure 4 | Simulated 1979-2004 trend patterns in annual mean fields in 
CMIP5. Linear trend (per decade) of surface temperature (a), 300-850 hPa 
temperature (b) and 200-hPa geopotential height (Z200; c) for the period 
1979-2004 due to anthropogenic and natural external forcing. Here we show 
the average of an ensemble of experiments using 40 climate models (Extended 
Data Table 1) that are archived in the CMIP5 database’® and used in the 
Fifth Assessment Report of the Intergovernmental Panel on Climate Change’. 
The trends shown in the plot are significant at above the 99% confidence level at 
all grid points. 
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the historical simulations using the CMIP5 models shows a general warm- 
ing throughout the globe—more so in the polar regions than in the 
tropics—with a maximum in the western Arctic Ocean due to reduced 
sea ice. The forced pattern of warming does not capture the cooling in 
the eastern tropical Pacific or the maximum warming over northeast- 
ern Canada and Greenland; in these regions, the CMIP5 models cap- 
ture only about 40-50% of the observed warming in the surface and 
troposphere. 

From a comparison of Figs 1c and 4c it is evident that the ensemble 
average of the historical simulations exhibits much larger Z200 rises in 
the tropics than observed, and slightly smaller increases in the polar 
region than observed. The most salient circulation feature in the obser- 
vations is the pronounced localized rise in upper-tropospheric geopo- 
tential height and surface and tropospheric temperature over northeastern 
Canada and Greenland (Fig. 1). These features are not reproduced in the 
ensemble average of the historical simulations using the CMIP5 models. 

The results from the ECHAM experiments forced by observed SST, 
together with the ensemble average of the historical simulations of the 
CMIP5 models, suggest that about half of the observed warming in nor- 
theastern Canada and Greenland is due to a uniform warming of the 
entire Arctic (directly attributable to anthropogenic forcing’); the other 
half is due to a wave train of circulation anomalies that extends from 
the central tropical Pacific to Canada, and which is forced by trends in 
the tropical Pacific SST. That the historically forced CMIP5 model average 
does not capture the observed tropical SST pattern further suggests that 
around half of the recent warming over Canada and Greenland is asso- 
ciated with natural variability that is intrinsic to the coupled atmosphere- 
ocean system. We note, however, that the observational records in the 
tropics are temporally and spatially incomplete, and that the simulated 
response of tropical SSTs to anthropogenic forcing is strongly model- 
dependent”®. Hence we cannot rule out the possibility that the pattern 
of the observed tropical Pacific SST trend may also be associated with 
external forcing. 


METHODS SUMMARY 


We used atmospheric circulation and temperature data from the 1979-2013 ERA- 
interim reanalysis'®, SST data from ERSSTv3 (ref. 29), and sea-ice data from 
HADISST”. We used output from 40 climate models archived in the CMIP5 database’® 
to estimate the climate response to external forcing in the historical record. MCA” 
captured the dominant coupled modes between the Northern Hemisphere Z200 
(0-88.5° N) change and tropical SST (20° N to 20° S). We used the ECHAM4.6 (ref. 27) 
atmospheric general circulation model to simulate the circulation and temper- 
ature response forced by the observed tropical (30° N to 30° S) SST trends; in the 
extratropics, the ECHAM4.6 is coupled to a slab ocean with a simple, thermodyn- 
amic-only sea ice model. We focus on the annual average in this study because the 
recent temperature and circulation changes over the region of interest are similar 
across all seasons. To construct an annual mean, we averaged the 12 months from 
June through to the end of the following May. Note that the year label refers to June 
(for example, the annual mean of 2012 is the average from June 2012 through to the 
end of May 2013). The results presented in this study are not sensitive to how the 
annual average is constructed. It should be noted that because the historical record 
of reliable data for the Northern Hemisphere polar region is short, the term ‘trend’ 
used in the main text refers to any low-frequency variability distinct from the inter- 
annual variability; it does not necessarily imply a long-term trend, but may simply 
reflect decadal variability. 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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METHODS 

Data. Atmospheric circulation and temperature data are from the 1979-2013 ERA- 
Interim reanalysis'’ and the Modern Era Retrospective- Analysis for Research and Ap- 
plications (MERRA)” reanalysis. SST data are from ERSSTV3 (ref. 29). Sea-ice data are 
from HADISST™. Rainfall data are from GPCP*". We also use land-only station surface 
and near-surface temperature data from University of Delaware” and NASA GISS Sur- 
face Temperature (GISTEMP)”. We used output from the historical simulations of 40 
climate models (Extended Data Table 1) that are archived in the CMIP5 (ref. 16) database. 
MCA. We use maximum covariance analysis (MCA”*) to capture the dominant 
coupled modes between the Northern Hemisphere Z200 (0-88.5° N) change and tro- 
pical SST (20°N to 20°S). MCA organizes the variance in two data sets into ortho- 
gonal patterns that maximize the covariance in the data. 

Wave flux analysis. Wave activity analysis* is used to reveal stationary Rossby 
wave energy propagation. The wave flux vector provides direct information on the 
flux of wave activity, which is parallel to the group velocity of quasi-stationary Rossby 
waves. This diagnostic tool is well suited for detection of propagating large-scale 
quasi-stationary Rossby waves. 
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Climate model. The general circulation model used to perform the experiments in 
this study is the ECHAM4.6 atmospheric general circulation model”, at horizontal 
resolution of T42 (~2.8° latitude X 2.8° longitude) with 19 vertical levels. We cou- 
pled the ECHAM4.6 to a slab ocean, whereby the ocean is represented as boxes of 
water of uniform specified depth (50 m). A simple, thermodynamic-only sea-ice 
model is included when the ocean temperature reaches the freezing point. The ocean 
temperature or sea-ice condition at each grid point is affected only by heat exchange 
across the air-sea interface; there is no direct communication between adjacent ocean 
grid points, nor is there any representation of the deep ocean. In addition, a cyclo- 
stationary climatological heat flux is added to the ocean temperature tendency equa- 
tion in order to maintain a seasonal cycle of ocean temperature and sea ice condition 
that is as close to that observed. 


31. Huffman, G. J., Adler, R. F., Bolvin, D. T. & Gu, G. Improving the global 
precipitation record: GPCP version 2.1. Geophys. Res. Lett. 36, L17808 
(2009). 

32. Plumb, R.A. On the three-dimensional propagation of stationary waves. J. Atmos. 
Sci. 42, 217-229 (1985). 
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Extended Data Figure 1 | Observed trend pattern of annual mean surface _c, University of Delaware and d, GISTEMP for the period of 1979-2012. The 
temperature for 1979-2012. Linear trend (per decade) ofannual mean surface _ grey regions in c and d indicate no data. 
and near-surface temperature for a, ERA-interim, b, MERRA reanalysis, 
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Extended Data Figure 2 | Observed trend pattern of seasonal mean surface to May (MAM), ¢, June to August (JJA) and d, September to November (SON) 
temperature for 1979-2012. Linear trend (per decade) of seasonal mean for the period 1979-2012. 
ERA- interim surface temperature for a, December to February (DJF), b, March 
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Extended Data Figure 3 | Observed trend pattern of seasonal mean seasonal mean ERA-interim 300-850 hPa temperature for a, DJF, b, MAM, 
300-850 hPa temperature for 1979-2012. Linear trend (per decade) of c, JJA and d, SON for the period 1979-2012. 
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Extended Data Figure 4 | Observed trend pattern of annual mean field for —_ height (Z500), and b, sea level pressure (SLP) of ERA-interim for the period 
1979-2012. Linear trend (per decade) of annual mean a, 500-hPa geopotential 1979-2012. Solid (dashed) contours indicate positive (negative) trends. 


©2014 Macmillan Publishers Limited. All rights reserved 


LETTER 


Z200 change (m) 4 


o nm Oo WM Oo mo 4 
Oo nun UN A a Ww I | 
—_ | T= 


= =| 

<x So 

= n 

re} ae) 
Zz Zz Zz oC Zz Zz Zz Co 
(=! oO oO [=3} oO Oo ad [== 
fon) io} ine) fom) ive} oD 


a DJF 


Extended Data Figure 5 | Observed trend pattern of seasonal mean 200-hPa__d, SON for the period 1979-2012. Solid (dashed) contours indicate positive 
geopotential height for 1979-2012. Linear trend (per decade) of seasonal (negative) trends. 
mean ERA-interim 200-hPa geopotential height for a, DJF, b, MAM, c, JJA and 
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Extended Data Figure 6 | Observed surface temperature and Z200 changein the right ordinate are °C (surface temperature) and unitless (NAO index). 
northeastern Canada and Greenland. a, Annual mean surface temperature,  b-e, Surface temperature and Z200 averaged over northeastern Canada and 
Surf-T(land), and Z200 averaged over northeastern Canada and Greenland Greenland (55° N-85° N, 280° N-340° N, denoted by a box in Fig. 1c) for each 
(55° N-85° N, 280° E-340° E, denoted by a box in Fig. 1c), and NOAA NAO season for the period 1979-2012. The correlation coefficient (r) of two curves is 
index" (sign is reversed for simplicity of comparison) for the period denoted in the bottom right corner. The first (second) number denotes the 
1979-2012. The units on the left ordinate are geopotential metres; the unitson _ correlation between the raw (detrended) time series. 
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Extended Data Figure 7 | Observed and simulated annual mean tropical simulation of ECHAM run forced by observed SST (1979-2012) in the tropics 
rainfall trend patterns for 1979-2012. Linear trend (mm per day per decade) (30° N to 30°S). In the extratropics and the polar region, the atmosphere is 
of annual mean a, observed GPCP”' rainfall, b, simulated rainfall from 34-year _ coupled to a slab ocean model with a thermodynamic sea-ice component. 
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Extended Data Figure 8 | MCA2 SST related annual mean land surface land surface temperature in ERA-interim during 1979-2012. b, As a but using 
temperature. a, Correlation (colour scale shows r) between the time series of _ the detrended surface air temperature data and the detrended time series of 
the MCA2 pattern of annual mean SST (blue curve in Fig. 2d) andannualmean mode 2 SST. 
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Extended Data Figure 9 | Coupled patterns between detrended annual the first mode in these two data sets. Panel c displays the time series of the Z200 
mean tropical SST and Northern Hemisphere circulation for 1979-2012. (red) and SST (blue) patterns shown in a. Panels b and d are the same as ina 
Figure shows MCA results for detrended annual mean 1979-2012 Northern and c but for the second mode. Amplitudes in a and b are scaled by one 
Hemisphere (0-88.5° N) 200-hPa geopotential heights (Z200) and tropical standard deviation of the corresponding time series in c and d; the latter have a 
(20° S to 20° N) sea surface temperature (SST). Shown in a are the patterns of standard deviation of one. 

Z200 (contour interval 10 metres) and tropical SST (shading) that accompany 
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Extended Data Table 1 | 40 climate models in the CMIP5 historical experiment 


CMIP5 model designation nx ny 
1. bcc-csm1-1 128 64 
2. bcc-csm1-1-m 320 160 
3. CCSM4 288 192 
4. CESM1-BGC 288 192 
5. CESM1-CAM5 288 192 
6. CESM1-FASTCHEM 288 192 
7. CESM1-WACCM 144 96 
8. CMCC-CESM 96 48 
9. CMCC-CM 480 240 
10. CMCC-CMS 192 96 
11. CNRM-CM5 256 128 
12. CSIRO-Mk3-6-0 192 96 
13. CanESM2 128 64 
14. F@OALS-g2 128 60 
15. FGOALS-s2 128 108 
16. FIO-ESM 128 64 
17. GFDL-CM2p1 144 90 
18. GFDL-CM3 144 90 
19. GFDL-ESM2G 144 90 
20. GFDL-ESM2M 144 90 
21. GISS-E2-H 144 89 
22. GISS-E2-H-CC 144 89 
23. GISS-E2-R 144 89 
24. GISS-E2-R-CC 144 89 
25. HadCM3 96 73 
26. HadGEM2-AO 192 144 
27. inmcm4 180 120 
28. IPSL-CM5A-LR 96 96 
29. IPSL-CM5A-MR 144 143 
30. IPSL-CM5B-LR 96 96 
31. MIROC-ESM 128 64 
32. MIROC-ESM-CHEM 128 64 
33. MIROC4h 640 320 
34. MIROC5 256 128 
35. MPI-ESM-LR 192 96 
36. MPI-ESM-MR 192 96 
37. MPI-ESM-P 192 96 
38. MRI-CGCM3 320 160 
39. NorESM1-M 144 96 
40. NorESM1-ME 144 96 
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List of 40 CMIP5 CGCMs used in Fig. 4 to examine the forced response of the climate system to anthropogenic and natural external forcing, along with the number of atmospheric horizontal grids. 
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Predicting biodiversity change and averting collapse 


in agricultural landscapes 


Chase D. Mendenhall'?, Daniel S. Karp'*"*, Christoph F. J. Meyer™®, Elizabeth A. Hadly” & Gretchen C. Daily*”*? 


The equilibrium theory of island biogeography’ is the basis for esti- 
mating extinction rates’ and a pillar of conservation science**. The 
default strategy for conserving biodiversity is the designation of 
nature reserves, treated as islands in an inhospitable sea of human 
activity’. Despite the profound influence of islands on conservation 
theory and practice*”, their mainland analogues, forest fragments in 
human-dominated landscapes, consistently defy expected biodiver- 
sity patterns based on island biogeography theory* *. Countryside 
biogeography is an alternative framework, which recognizes that 
the fate of the world’s wildlife will be decided largely by the hospi- 
tality of agricultural or countryside ecosystems'”"*"’. Here we directly 
test these biogeographic theories by comparing a Neotropical coun- 
tryside ecosystem with a nearby island ecosystem, and show that 
each supports similar bat biodiversity in fundamentally different 
ways. The island ecosystem conforms to island biogeographic pre- 
dictions of bat species loss, in which the water matrix is not habitat. 
In contrast, the countryside ecosystem has high species richness and 
evenness across forest reserves and smaller forest fragments. Rela- 
tive to forest reserves and fragments, deforested countryside habitat 
supports a less species-rich, yet equally even, bat assemblage. More- 
over, the bat assemblage associated with deforested habitat is com- 
positionally novel because of predictable changes in abundances by 
many species using human-made habitat. Finally, we perform a global 
meta-analysis of bat biogeographic studies, spanning more than 700 
species. It generalizes our findings, showing that separate biogeo- 
graphic theories for countryside and island ecosystems are necessary. 
A theory of countryside biogeography is essential to conservation 
strategy in the agricultural ecosystems that comprise roughly half of 
the global land surface and are likely to increase even further™. 

The mark of humanity on the biosphere is now so widespread and 
profound that many consider Earth to have entered a new geological 
epoch, the Anthropocene. This new epoch has brought an explosion of 
countryside ecosystems—arenas where villages, farmland and minimally 
altered habitats weave together to form intricate, human-dominated 
ecosystems—where the fate of much of the planet’s wildlife is being 
determined'*”’. Driven by the unrelenting pace of biodiversity loss and 
the erosion of benefits that nature provides to society’, conservation 
science has entered a new phase of prolific innovation'’. Despite the 
broadly defined and rapidly adapting framework of conservation science”, 
many of the biological theories at its centre persist, relatively unchanged. 
For example, theories from simple island ecosystems are still used in ways 
that incorrectly estimate rates of species extinction” and distort pro- 
jections of ecological risk in human-dominated ecosystems*”’, further 
exhausting an environmental, apocalyptic narrative’. Confusion and 
controversy surrounding the biodiversity crisis are substantially related 
to the overextension of the theory of island biogeography to human- 
dominated ecosystems, exacerbating a rift between conservation theory 
and real-world practices**”*. 


More than 30 years after the establishment of experiments® designed to 
test the efficacy of island biogeographic theory in countryside ecosystems, 
we now know that wildlife often defies patterns predicted by island 
biogeography theory—emphasizing the critical role of human-made 
habitats in ecosystems'"'*"!°*!**, Despite its known shortcomings'*”’, 
this theory persists as the basis for estimating extinction rates and mak- 
ing policy recommendations*'”’. In response, a new, holistic view of 
biogeography is emerging for countryside ecosystems—one that incor- 
porates human-made habitats and their associated biodiversity'*7***. 

Here we predict that human-made habitats typical of many coun- 
tryside ecosystems will provide resources for many species to increase 
their abundances and mitigate the extirpation of populations and spe- 
cies. We propose three hypotheses to contrast countryside and island 
biogeographic theories empirically: first, that forest fragments in agri- 
cultural landscapes, in comparison with true islands, support more spe- 
cies and have lower rates of local extinction; second, that forest fragments 
surrounded by deforested habitat host more evenly distributed abundances 
of species than do true islands surrounded by water because human- 
made habitats support individuals to differing degrees, through part 
or all of their life cycles; and third, that the expansion by many forest 
species into human-made habitats in the countryside ecosystem is gen- 
erating novel assemblages of biodiversity, whose members and their 
abundances reflect species-specific responses to habitat change (Fig. 1). 

We test these hypotheses by using data on Neotropical bat biodiver- 
sity, a group acutely sensitive to tropical deforestation’. First, we con- 
trast bat biodiversity within a mosaic of forest fragments embedded in 
farmland with that on true islands within a large, human-made lake. 
Second, we perform a meta-analysis of 29 studies of bat species rich- 
ness across countryside and island ecosystems worldwide, to assess the 
generality of our findings. 

Specifically, we contrast bat biodiversity in a Costa Rican country- 
side dominated by coffee plantations and pastures with a Panamanian 
island ecosystem dominated by water (Fig. 2). The two ecosystems are 
ideal for testing biogeographic theories because of their approximately 
similar ages, their close geographic proximity, their evolutionarily homol- 
ogous bat diversity and their similarly sized historical biogeographic 
species pools. 

Bats were captured over a 2-4-year period in sites located in forest 
reserves, forest fragments and coffee plantations in the countryside eco- 
system and in mainland forests, mainland forest edges and true islands 
in the island ecosystem”® (Fig. 2; see Methods). We captured 9,283 indi- 
vidual bats (4,282 and 5,001 individuals in the countryside and island 
ecosystem, respectively) and 62 bat species (43 and 33 species in the 
countryside and island ecosystem, respectively; 14 species were shared 
between the ecosystems). We found striking differences between eco- 
systems in three dimensions of biodiversity corresponding to our three 
hypotheses: species richness, species evenness and abundance-based assem- 
blage composition (Fig. 3 and Extended Data Tables 1-3; see Methods). 


1Center for Conservation Biology, Stanford University, Stanford, California 94305, USA. Department of Biology, Stanford University, Stanford, California 94305, USA. *Department of Environmental 
Science, Policy & Management, University of California, Berkeley, California 94720, USA. “The Nature Conservancy, Berkeley, California 94705, USA. °Institute of Experimental Ecology, University of Ulm, 
89069 Ulm, Germany. °Centre for Environmental Biology, University of Lisbon, 1749-016 Lisbon, Portugal. Woods Institute for the Environment, Stanford University, Stanford, California 94305, USA. 
8Global Economic Dynamics and the Biosphere, Royal Swedish Academy of Sciences, Stockholm SE-104 05, Sweden. 2Stockholm Resilience Centre, University of Stockholm, Stockholm SE-106 91, 
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To test our first hypothesis, we compared estimated species richness 
in forest fragments with that in true islands (Fig. 3b). In agreement with 
the species—area relationships that include human-made habitats’*"’, the 
best-fit statistical models that explained bat species richness in forest 
reserves and fragments in the countryside ecosystem suggested either 
no change in species richness between habitats (mean species estimated 


Sampling sites and habitats 


A Reserve A Mainland =i Forest habitat 
@ Forest fragment Mainland edge Deforested habitat 
} Coffee plantation @ Island | Water 


Figure 2 | Bat sampling locations. A total of 36 locations were sampled for bat 
biodiversity: 19 across the countryside ecosystem located in southern Costa 
Rica (a) and 17 in the human-made island ecosystem of Gatun Lake, 
constituting a major part of the Panama Canal (b). The two ecosystems are 
separated by about 350 km and occupy the neighbouring tropical life zones of 
premontane wet and lowland moist forests. Both ecosystems were extensively 
sampled for bats by using ground-level mist nets over multiple years. In the 
countryside ecosystem, sampling was conducted in two different forest 
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Figure 1 | Hypothetical biodiversity changes in countryside and island 
ecosystems. a—c, Conceptual diagrams showing hypothesized patterns of 
biodiversity in a minimally altered ecosystem (a) and corresponding 
countryside (b) and island (c) ecosystems derived from hypothetical land 
conversion either to agriculture or a lake, respectively. d-f, Proposed patterns 
of biodiversity for minimally altered (d), countryside (e) and island 

(f) ecosystems, depicted by rank-abundance plots. In each rank-abundance 
plot, the most abundant species at each site is given a species rank of 1, the 
second most abundant is 2, and so on. Pin labels mark sites and habitats with 
letters in each ecosystem and are referenced in the rank—abundance plot. In the 
countryside and island ecosystems, letters represent sites located in the 
mainland or reserve interior (A), mainland or reserve edge (B), forest fragment 
or true island (C) and deforested habitat or water matrix (D). We speculate that 
species loss generally occurs after landscape alteration, but there is wide 
variation between countryside and island ecosystems in how biodiversity 
changes in terms of species loss, changes in abundances and the formation of 
novel assemblages of biodiversity among habitats, because of the resources 
afforded in human-made habitats in countryside ecosystems. 


per site = 25 (s.d. = 4); Extended Data Table 4) or little change (competing 
model with a difference in the corrected Akaike information criterion 
(AAIC,) = 0.220 yielded a mean species estimated per reserve site of 
28 (s.d. = 5) and a mean species estimated per forest fragment site of 
23 (s.d. = 3)). Fragment size and isolation distance were poor predictors 
of species richness among countryside forest reserves and fragments 
(Extended Data Table 3). 

Aligning with our first hypothesis and predictions of island biogeo- 
graphic theory, the estimated number of bat species in the island eco- 
system declined with isolation distance but increased with island size 
(Extended Data Table 4). The area of remaining tropical forest within 
1 km, a more ecologically relevant measure of habitat amount", was the 


? Island 
aS ecosystem 


Countryside “SS Panama 
ecosystem cain 


Gatun Lake, 
Panama Canal 


Islana| 


reserves, a variety of forest fragments, and coffee plantations. In the island 
ecosystem, sampling was conducted in the interior mainland, mainland edges 
and on a variety of islands. The island ecosystem was sampled from 2003 to 
2005 and the countryside ecosystem from 2009 to 2012 with similar sampling 
techniques and effort (see Methods). In the countryside ecosystem figure the 
sampling location inside La Amistad International Park is not shown but is 
located about 25 km northeast of the Las Cruces Biological Reserve of the 
Organization for Tropical Studies. 
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strongest predictor of species richness on islands (R? = 0.746, P< 0.001, 
n = 17; Fig. 3a and Extended Data Table 4). In support of our first hypoth- 
esis, the rate of species loss as forest area declined was substantially and 
significantly higher in the island ecosystem than in the countryside eco- 
system (Extended Data Table 2). 

To test our second hypothesis, we analysed Pielou’s evenness index 
(Fig. 3c). Forest reserves and fragments in the countryside ecosystem did 
not vary in evenness—probably because bat populations are supported 
beyond forest edges in the deforested habitat. In the island ecosystem, 
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Figure 3 | Countryside and island bat biodiversity patterns. Countryside 
and island ecosystems support bat biodiversity in fundamentally different ways. 
A total of 62 bat species were recorded: 43 were captured in the countryside 
ecosystem (left column) and 33 in the island ecosystem (right column); of these, 
14 bat species were shared. Shown are the countryside and island ecosystem 
rank—abundance distributions for each site (a) and patterns of bat species 
richness (b), species evenness (c) and the Assemblage Abundance Shift Index 
(d), which accounts for changes in species richness. Symbol shapes and colours 
correspond to those shown in Fig. 2. Lines depict best-fit relationships after 
model selection and optimization. Dotted lines correspond to coffee 
plantations in b and islands in d, where broad habitat type was the best 
predictor. Analyses in b and c were conducted at a site level (36 sites). Species 
richness was estimated to account for rare species using the Chaol method and 
species evenness is standardized between 0 and 1 using the Pielou method. 
Analysis in d was conducted at a net level (383 nets), because assemblage 
differences were observed within sites relative to fine-scale changes in forest 
cover at 150 m. See Methods for further details, and Extended Data Tables 1-4 
for model comparisons and regression coefficients. 


however, species abundances were increasingly uneven on more isolated 
islands (Fig. 3c and Extended Data Table 5). Uneven abundance dis- 
tributions of bats in the island ecosystem are probably explained by 
island biogeography’s faunal density compensation theory”. 

To test our third hypothesis, we examined bat biodiversity patterns 
in a variety of coffee plantations in the countryside ecosystem (Fig. 3). 
In coffee plantations we captured 1,508 individuals representing 26 of 
the 43 bat species (mean species estimated per coffee plantation site = 18 
(s.d. = 2); Fig. 3b and Extended Data Table 4), nine of which were observed 
roosting and/or breeding on farmland but not in pastures”* (Extended 
Data Table 6). Moreover, bat species in coffee plantations were evenly 
distributed in their abundances (Fig. 3c). Species varied considerably 
in their abundance responses to deforestation, but 14 of the 30 bat spe- 
cies with at least seven captures in the countryside ecosystem were more 
abundant outside the forest reserves. Further, no species showed com- 
plete dependence on a single habitat type (Extended Data Fig. 1). 

Finally, to evaluate whether a novel bat assemblage is forming in coffee 
plantations because of predictable, species-specific changes in abun- 
dances, we developed and compared an index that measured assemblage- 
level changes in bat abundances across habitats in both ecosystems. The 
Assemblage Abundance Shift Index is based on ordination analyses of 
bat abundances and how they shift collectively, relative to bat assem- 
blages in forest reserves in the countryside ecosystem and to mainland 
sites in the island ecosystem. The Assemblage Abundance Shift Index 
accounts for changes in species richness and detection bias (see Methods 
and Extended Data Fig. 2). Patterns of the Assemblage Abundance Shift 
Index revealed that the abundances of many bat species in the coun- 
tryside ecosystem underwent predictable transitions in tandem with 
declining forest cover, particularly at small spatial scales’’ (150 m) and 
especially in coffee plantations. In the island ecosystem, bat abundances 
differed significantly between assemblages on the mainland interior 
and edge sites in comparison with islands, probably because of density 
compensation”’ (Fig. 3d). 

Our meta-analysis found strong evidence that bat biodiversity pat- 
terns follow predictions of island biogeographic theory on true islands— 
both natural and human-made—but countryside ecosystems are more 
complex. A review of 206 peer-reviewed papers on bat biodiversity in 
island and countryside ecosystems yielded several key findings from the 
29 studies that met meta-analysis search criteria (Fig. 4a; see Methods). 
First, we confirmed that the equilibrium theory of island biogeography 
accurately forecasted the change in bat species richness with island size 
on true islands (Fig. 4b), and bats on islands followed typical species— 
area relationships (Fig. 4c). Second, we found that declines in bat spe- 
cies richness expected from island biogeography were almost never 
realized in countryside forest fragments (Fig. 4b). Third, we found a 
variety of bat species richness responses in open habitats of country- 
side ecosystems, including relatively high richness in some agricul- 
tural systems'*** (Fig. 4d). Finally, we found that 10 of the 12 studies 
that compared bat assemblages between minimally altered forest, forest 
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Figure 4 | Bat species richness responses in countryside and island 
ecosystems worldwide. Globally, countryside ecosystems and island 
ecosystems support biodiversity in fundamentally different ways, necessitating 
separate biogeographic theories for prediction and management. a, Map 
identifying the locations of 52 bat species richness comparisons in different 
countryside and island habitats gathered from 29 studies. In all, the studies 
surveyed more than 700 bat species, representing more than 60% of all bat and 
10% of all mammal species globally. b-d, Island bat species richness always 
declined relative to the mainland or to larger islands in island ecosystems 

(b). For example, bat species richness on islands in the Caribbean and South 
Pacific yielded a typical species—area relationship (c; S = cA‘, c = 0.043, 

z= 0.269, where S is the number of species, A is the habitat area, c is the 


fragments and/or human-made habitats reported a significant assem- 
blage shift in bat biodiversity (see Supplementary Data). 

In view of humanity’s incomplete knowledge of nature, the designa- 
tion of protected areas is, and will continue to be, an essential strategy 
for conserving biodiversity’. The widespread application of island bio- 
geographic theory to countryside—less so in Europe, but especially in 
the Tropics—is, however, distorting our understanding and conserva- 
tion strategies in agriculture, the enterprise on which the future of bio- 
diversity hinges most critically. The hospitality of the world’s agricultural 
lands is threatened by an increasing use of chemical inputs and prac- 
tices that sterilize, structurally level, and standardize plots'*’*'?*— 
homogenizing and decimating biodiversity’. Here we provide evidence 
that a countryside biogeographic framework that is inclusive of human- 
made habitats and the opportunities it can afford to many species— 
given appropriate management of those habitats'*”*—better represents 
how, at least, bat biodiversity is responding in the Anthropocene. App- 
roaches for estimating biodiversity that explicitly account for human- 
made habitats*'’"'*?"”° are critical for predicting biodiversity change 
and avoiding its collapse in the agricultural landscapes that will increas- 
ingly dominate the Earth. 


METHODS SUMMARY 


Bats were sampled by using constant-effort mist netting. In the Costa Rican coun- 
tryside ecosystem, mist-netting protocols consisted of 20 ground-level mist nets 
(12 m X 2.5 m, 32-mm mesh) ina 3-5-ha plot haphazardly placed at each site, except 
the ‘supersite’ located in La Amistad International Park, which consisted of 60 
haphazardly placed mist nets within a 10-ha plot. Constant-effort mist netting for 
bats in the countryside ecosystem took place between 24 January and 28 March for 
four years (2009-2012). Sampling began at sunset and lasted 4h each night. Coun- 
tryside ecosystem sites were sampled on three nights each year from 2009 to 2011 
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constant associated with the area measurement, and z is the slope of the 
species-area relationship in log-log space). In contrast, bat species richness in 
countryside forest fragments generally had the same number of species as 
minimally altered forest habitat (b). Bat species richness in open habitats 
compared to forest habitat varied (b). Studies with enough information to 
calculate effect sizes demonstrated how some agricultural practices, such as 
agroforestry, support more bat biodiversity than more intensive agricultural 
practices (d; effect size direction indicates species richness decline or increase 
compared to forest habitat, error bars represent 95% confidence intervals). 
Inten., intensive. Country code abbreviations: CR, Costa Rica; CO, Columbia; 
ID, Indonesia; MX, Mexico. 


and on only one night in 2012, except the ‘supersite’ located in La Amistad Inter- 
national Park, which was sampled for three nights in 2009 and three nights in 
2010. In the Panamanian island ecosystem, mist-netting protocols consisted of six 
ground-level mist nets (6m X 2.5 m, 16-mm mesh) placed around the perimeter 
of a 0.5-ha plot at each site. Constant-effort mist netting for bats in the island eco- 
system took place between October 2003 and October 2005. Each site in the island 
ecosystem was sampled for seven or eight complete nights (that is, from sunset to 
sunrise) over a 2-year period. All captured bats in both ecosystems were identified 
and standard measurements were collected. Adult bats from larger species (more 
than 5-10 g) were marked individually. 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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METHODS 


Costa Rican countryside ecosystem bat sampling. The countryside investigation 
was centred on the Las Cruces Biological Station and Wilson Botanical Garden 
located in the Coto Brus Valley in southwestern Costa Rica. The Las Cruces Bio- 
logical Reserve (8° 47' N, 82°57’ W, 1,100 m above sea level), which serves as a 
regional biological baseline, protects about 280 ha of primary and mature secondary 
forest (Fig. 2). The climate is characterized by a long rainy season (9-10 months) 
followed by a brief dry season, with an average annual rainfall of 3.4 m (ref. 31). 
Forests in the area are classified as premontane tropical wet forest”. More than 
60% of the Coto Brus Valley, once forested, has been converted into cropland and 
pasture since the 1950s (refs 33, 34). Despite relatively recent deforestation, sedi- 
ment core analyses reveal a robust, 3,000-year history of agriculture, forest clearing 
and fire in the region up to the sixteenth century”. 

Bats were sampled in seven general habitat types at 19 locations within 25 km of 
Las Cruces Biological Reserve (Fig. 2). All except one habitat type were represented 
by three sites located in (1) the Las Cruces Biological Reserve (three sites, located 
1.3, 1.8 and 1.8km apart from their nearest neighbouring site in the reserve), 
(2) small forest fragments (1-20 ha, three sites), (3) 10-20-year-old secondary forest 
located in large forest fragments (50-75 ha, three sites), (4) thin strips of riparian 
remnant forests (10-100 m wide, three sites), (5) coffee plantations with about 
25% local tree cover on farms (three sites) and (6) coffee plantations with about 5% 
local tree cover (three sites). Moreover, one ‘supersite’, or a site equivalent in sam- 
pling effort to three sites (that is, 60 mist nets were deployed, instead of the normal 
20 mist nets at the other 18 sites), operated in La Amistad International Park located 
about 25 km northeast of the Las Cruces Biological Reserve of the Organization for 
Tropical Studies. Bat sampling was conducted using constant-effort mist netting. 
Mist-netting protocols consisted of 20 ground-level mist nets (12 m X 2.5 m, 32-mm 
mesh) in a 3-5-ha plot haphazardly placed at each site, except the ‘supersite’ located 
in La Amistad International Park, which consisted of 60 haphazardly placed mist 
nets within a 10-ha plot. Constant-effort mist netting for bats took place between 
24 January and 28 March for four years (2009-2012). Sampling began at sunset and 
lasted for 4h each night. Sites were sampled on three nights each year from 2009 
to 2011 and only one night in 2012, except the ‘supersite’ located in La Amistad 
International Park, which was sampled for three nights in 2009 and three nights 
in 2010. 

Captured bats were identified and standard measurements were collected by the 

method of ref. 37. In 2009, bats were individually, but temporarily, marked by using 
wing punches’. From 2010-2012 most bats (species weighing more than 5 g, exclud- 
ing juveniles) were marked with collars fashioned from self-locking cable ties and 
individually numbered aluminium bands”. All bats were released on site shortly 
after identification, marking and measurement. All animals were handled and 
treated ethically within the guidelines of the Institutional Animal Care and Use 
Committee. 
Panamanian island ecosystem bat sampling. The island investigation was con- 
ducted on the human-made Gattin Lake (9° 11’ N, 79° 53’ W, 26 m above sea level) 
located in Central Panama. The 1914 damming of the Chagres River during con- 
struction of the Panama Canal created the lake and more than 200 islands ranging 
in size from less than 1 ha to the well-studied 1,560-ha Barro Colorado Island (BCI). 
Together with five adjacent mainland peninsulas, BCI forms the 5,400-ha Barro 
Colorado Nature Monument and is contiguous with 22,000-ha Soberania National 
Park. The climate is strongly seasonal, with a long rainy season followed by a 4-month 
dry season. Average annual rainfall is 2.1 m (ref. 38). Dry-season winds have affected 
exposed islands, resulting in lower forest stature and tree species diversity”. Forests 
in the area are classified as lowland tropical moist forest*. 

Bats were sampled at six sites on the mainland peninsulas and at eleven sites on 
islands of various sizes and distances from the mainland. Bat sampling was con- 
ducted with constant-effort mist netting. Mist-netting protocols consisted of six 
ground-level mist nets (6m X 2.5m, 16-mm mesh) placed around the perimeter 
ofa0.5-ha plot at each site. Constant-effort mist netting for bats took place between 
October 2003 and October 2005. Each site was sampled for seven or eight complete 
nights (that is, sunset to sunrise) over a 2-year period. Bats were also captured with 
a single, large wall net. Bats captured in the wall net were excluded from this study 
because comparable sampling of the forest canopy was not conducted in the Costa 
Rican countryside ecosystem. 

Captured bats were identified, and standard measurements and demographic 
data were collected as described in ref. 41. Most bats (species weighing more than 
10 g, excluding juveniles) were marked with individually numbered ball-chain neck- 
laces, or, in the case of larger species, with passive, subcutaneous transponders 
(EURO-ID). All bats were released on site shortly after marking and measurement. 

Detailed site descriptions and additional methods and results in the Panama- 
nian island ecosystem can be found in ref. 26. All animals were handled and treated 
ethically within the guidelines of the Institutional Animal Care and Use Committee. 


Landscape habitat classifications. To characterize landscape context in Costa 
Rica we conducted a manual habitat classification by digitizing aerial photographs. 
Manual habitat classification was favoured over automated approaches to facilitate 
the accurate identification of fine-scale landscape features in a complex country- 
side ecosystem”. Orthorectified aerial photographs used in the habitat classifica- 
tion were from Costa Rica Airborne Research and Technology Applications taken 
in 2003 and 2005 at 2 m spatial resolution. The habitat map covered 23,600 ha and 
included all sampling locations. The area was delineated to encompass all study 
locations sampled by Stanford University’s Center for Conservation Biology (see, 
for example, refs 33, 42-48). The four ecosystem elements digitized at 2 m resolu- 
tion over this extent included forest elements (primary and secondary forest of all 
sizes, single trees, charral (early secondary forest), large shrubs, live fences, hedge- 
rows, non-native timber and fruit tree plantations, and non-native garden orna- 
mentals; Fig. 1); agricultural plots (the majority being sun coffee plantations, but 
also including banana, plantain, tomato, and chili plantations); pasture; and houses, 
roads and other rural infrastructure. The final map was verified in the field from 
multiple vantage points, compared with previous studies in the landscape and checked 
with 150 random samples across the landscape. Forest elements from the manual 
habitat classification were used in analyses, whereas agricultural plots, pasture, houses, 
roads and other rural infrastructure were broadly treated as deforested habitat. 

To characterize landscape context in Panama we used the ‘BCI Basemap’ data 

set” to delineate shorelines and islands of Lake Gatun with the ‘BCI Landcover 
Map 2003’ data set® to confirm forest, water and agriculture cover. Both data sets 
were accessed in early 2013 online through the Smithsonian Tropical Research 
Institute GIS Data Portal (http://mapserver.stri.si.edu/geonetwork/). All islands 
were completely forested except for 1 ha on a 16-ha island. 
Landscape explanatory variable calculations. Several relevant landscape metrics 
were calculated for the Costa Rican countryside ecosystem and the Panamanian 
island ecosystem. The landscape metrics included measures that captured the related 
variables of habitat area and habitat isolation. We also examined the importance of 
spatial scale when measuring some of the landscape characteristics in this study. The 
specific landscape variables calculated and used in statistical modelling included 
fragment/island size, isolation from contiguous tropical forest, general habitat type, 
and total forest area within a range of spatial scales. We do not report effects of forest 
or island fragmentation in this study because measures of fragmentation are often 
confounded with area and merit a separate, in-depth analysis for clarity*’. 

First, we calculated forest fragment or island size in each ecosystem. In the Pana- 
manian island ecosystem each island was clearly distinct and the contiguous forest 
habitat on each island was easily measured as the total size of the island. The forest 
fragments in the countryside ecosystem of Costa Rica, however, were slightly more 
difficult to measure: forest fragments are not easily distinguished because of many 
fine-scale forest connections across the farming landscape” (Fig. 2). In the coun- 
tryside ecosystem nearly every forest fragment is connected to other forest rem- 
nants by live fences, riparian strips and fine-scale ribbons and clusters of trees that 
weave across ridges, valleys and property boundaries. In most cases these fine-scale 
forest connections are less than 20m in width. We therefore arbitrarily omitted 
forest element connections that were less than 20 m in width to calculate forest 
fragment size in the countryside ecosystem. 

Second, habitat isolation for the countryside and island ecosystems was mea- 
sured as the Euclidean distance from the centre of a site or mist net to the nearest 
edge of the Las Cruces Biological Reserve, La Amistad International Park or the 
nearest Panamanian mainland shore. Sites located inside reserves or mainland pen- 
insulas received negative values that measured their distances from reserve edges 
and mainland shores. 

Third, we included general and broad habitat types as discrete explanatory 
variables in our models. In the countryside ecosystem each site and net was labelled 
as reserve, forest fragment or coffee plantation. In the island ecosystem we assigned 
sites and nets into three categories: mainland interior, mainland edge or island, 
following ref. 26. Moreover, some categories were lumped during analysis to check 
for broader differences between islands and all mainland sites in the Panamanian 
island ecosystem or between forest and non-forest sites in the Costa Rican country- 
side ecosystem. 

Finally, we calculated the total forest area—contiguous or not—for each site and 
each mist net at multiple spatial scales. For example, the total amount of forest area 
at each site and mist net, including nearby forest fragments and neighbouring islands, 
was measured at radii ranging from 50 m to 1 km (measurements were calculated 
at 50-m increments). 

Selecting spatial scales for measuring forest area. Countryside ecosystems and 
island ecosystems differ substantially because of landscape context. Countryside 
ecosystems often contain high proportions of human-made habitat that, generally, 
includes considerable vegetative structure and, depending on the spatial scales used 
to view habitat characteristics, many smaller ‘patches’ of habitat—often embodied 
in the form of single trees or fallowed agricultural fields. The surrounding landscape 
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context in island ecosystems is strictly homogeneous, except for neighbouring 
islands and the mainland shore. In view of the substantial differences in landscape 
context and recently proposed ideas, specifically the habitat amount hypothesis"’, 
we champion total habitat area within an appropriately selected spatial scale as a 
more ecologically relevant measure of habitat. 

We used an established approach described in detail in ref.33 to compare and 

select appropriate spatial scales during analysis. All results were robust to forest 
area measured at different spatial scales. For clarity, two spatial scales were selected 
to measure forest area after a priori and post hoc analyses, and statistical modelling 
revealed that few differences in results were observed when forest area was mea- 
sured among a broad range of spatial scales. We chose spatial scales that emerged 
as strong predictors after we verified that forest area measured at all spatial scales 
generally agreed. Site-level analyses used forest area measured within a 1-km spatial 
scale, and net-level analyses used forest area measured within a 150-m spatial scale. 
We do not report the nuances observed between different spatial scales in this study, 
which merit a separate, in-depth analysis to determine their ecological relevance’’. 
Dimensions of bat biodiversity: species richness estimates. Bat captures in the 
countryside and island ecosystems were used to inform three dimensions of bio- 
diversity metrics at the resolution of a site or a mist net. The first dimension of bat 
biodiversity was an estimate of bat species richness at each site (that is, the estimated 
number of unique species at each site). Chao species richness estimates of bats 
were used because they perform well with small sample sizes and uneven sampling 
effort**. Chao species richness estimates were used in analyses except when the 
standard error surrounding the species richness estimate was undefined when 
calculated in R v.2.15.2 with the vegan package. In cases in which species richness 
estimates were obtuse, the observed species richness (that is, the observed number 
of unique species captured at a given site) was used in analysis. In all, four sites 
used observed species richness values (two Costa Rican coffee plantation sites, one 
Costa Rican forest fragment site and one Panamanian island site). Results were 
consistent when observed species richness was used, but the inclusion of Chao 
species richness estimates dealt with differences in sampling effort between studies 
in Costa Rica and Panama. 
Dimensions of bat biodiversity: species evenness estimates. The second dimen- 
sion of bat biodiversity is Pielou’s species evenness index; this was calculated for 
each site. Species evenness refers to how equal the assemblage of bat species is numer- 
ically, or how close in abundance species at a site are. Pielou’s species evenness 
index is based on the Shannon-Weaver diversity index (denoted by H?): 


s 
H'=—) 0 pilnp; (1) 
i=l 
In the Shannon-Weaver diversity index, p; is the proportion of species i at a site, 
and S is the number of species at a site. Pielou’s species evenness index, J, was then 
calculated for each site; it is defined as 
H’ 
=o 2 
[ice (2) 


Dimensions of bat biodiversity: Assemblage Abundance Shift Index. The third 
dimension of bat biodiversity in this study centred on changes in abundance-based 
assemblages of bats relative to bat assemblages found in the baseline habitats for 
each ecosystem. Bat assemblage abundance shifting was calculated for each mist 
net because field observations and preliminary analyses of bat assemblages in the 
countryside ecosystem suggested that differences in the bat assemblage existed 
between nets within the same site relative to a net’s proximity to nearby trees, coffee 
plantations and pastures. For example, patterns of assemblage differences emerged 
nightly at a given site when field assistants compared species and abundances of 
bats extracted from groups of nets closer to habitat transitions with those of bats 
captured in nets located uniformly in forested habitat or coffee plantations. 

The degree of assemblage abundance shift was based on an index we developed 
that compared a net’s abundance-based assemblage similarity to that of the set of 
nets located in the Las Cruces Biological Reserve and La Amistad International 
Park, in the countryside ecosystem, or the Panamanian mainland, in the island 
ecosystem. We used Chao abundance-based similarity coefficients to calculate the 
similarity of a net’s abundance-based assemblage to that of all the nets in each eco- 
system baseline. The Chao abundance-based similarity coefficient measures sim- 
ilarity between two samples, and implements a Jaccard-type index that accounts 
for unseen, shared species; it is defined as 


‘ UW 

abd(U,W) = >. 3 
al ) U+W-—UW (3) 
In equation (3), Vand W represent estimates of each site in such a way that they 
are adjusted for unseen shared species To account for the effect of unseen shared 
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species, the index uses frequencies of observed, rare, shared species to estimate an 
appropriate adjustment term that accounts for the unseen, shared species. The 
adjustment is achieved by using the following equations for each estimate: 


D: Di 

BS ‘AX; | (m—1) fer Xi 

U 4 + > I(Y;=1) 4 
<n m 2fyaan ( (4) 
Du D 

AY: (n—-) far Yi 

Ww | I(X;=1 5 
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In equations (4) and (5) each function accounts for unseen, shared species between 
two sites by using the following logic: ifa random sample of n individuals (sample 1) 
is taken from site 1 and a random sample of m individuals (sample 2) from site 2, 
and species frequencies in each sample are denoted as (Xj, X2,...,Xs1) and (Yj, 
Y>, ...» Ygo), respectively, with zero used where species were missing from the sam- 
ple. Then the pairs of frequencies for the S;2 species truly shared by the two assem- 
blages are (X,, Y;)(X2, Yo) ... (Xs12, Ysi2). Assume that Dy of the S,, shared species 
available are actually observed in both samples and that their frequencies are the 
first D,, pairs. Thus, an additional S;, — D,, species are shared by the two assem- 
blages but absent from one or both samples. 

In equations (4) and (5) the indicator function I(expression) is defined such that 
I= 1 ifthe expression is true, and I = 0 if false. Let fi 4. = ee 1[X;=1,Y;> 1] be 
the observed number of shared species that are singletons (X; = 1) in sample 1 
(these species must be present in sample 2, but may have any abundance). Now, let 
fo+ represent the observed number of shared species that are doubletons (X; = 2) 
in sample 1. Similarly, f,.; and f,.2 are the observed number of shared species that 
are, respectively, singletons (Y;=1) and doubletons (Y;=2) in sample 2. If 
f+2 =0 or fo4 = 0, replace f,. and f2+ in the denominators by f,. + 1 or fi4 + 
1, respectively. If U or W is greater than 1, then it is replaced by 1. See ref. 53 for 
more details, explanations and derivation of equations. 

The Chao abundance-based similarity coefficients were calculated for each net 
(denoted as k) with all nets in the Las Cruces Biological Reserve and La Amistad 
International Park in the countryside ecosystem, or the mainland in the Pana- 
manian island ecosystem (denoted as b;), using variations of equations (3) and (4) 
in R v.2.15.2 with the vegan package. The arithmetic means of all Chao abundance- 
based similarity coefficients for each net with each baseline sample were calculated 
(denoted as b): 


M,= - 5 jabal( kb) (6) 


i=1 
The mean Chao abundance-based similarity coefficients for each baseline were 
also calculated relative to all baseline nets for each ecosystem. Finally, we standar- 
dized all values by dividing each M, value by the maximum M, value in each 
ecosystem (M,max): 
Mp 
7 Mymax 


(7) 


The resulting values for each net (that is, M) are between 0 and 1, and represent the 
abundance-based assemblage similarity of a single net to all the nets located in the 
corresponding ecosystem baseline, while accounting for detection biases of unseen, 
shared species. Therefore, the closer a net’s value is to 1 in the countryside eco- 
system, the more similar that net is in bat species composition and abundances to 
those of all the nets located inside the Las Cruces Biological Reserve and La Amistad 
International Park. Similarly, the closer a net’s value is to 0 in the island ecosystem, 
the less similar that net is in bat species composition and abundances to those of all 
the nets located on the Panamanian mainland. 

As calculated in equations (3)-(7), the Abundance-Based Assemblage Simila- 
rity Index (that is, M) is dependent on changes in species richness and abundances. 
The Abundance-Based Assemblage Similarity Index was therefore modified to 
account for species richness. To account for the effect of species richness in the 
Abundance-Based Assemblage Similarity Index we performed a regression between 
that index and the observed species richness values for each net and calculated the 
residuals from the best-fit model (Extended Data Fig. 2). Nets with five or fewer 
total captures were excluded from the Assemblage Shift Index (21 nets from the 
countryside ecosystem and zero nets from the island ecosystem). Positive residuals 
indicate a greater similarity than expected in assemblage abundances of a net to 
those of all nets in the corresponding baseline. Negative residuals indicate greater 
differences in the assemblage abundances than expected. 

In its final form, the Assemblage Abundance Shift Index was independent of 
observed species richness, estimated species richness, species evenness, and total 
bat abundance. The Assemblage Abundance Shift Index is a holistic measure of how 
the bat assemblage changes in abundances across space, independently of changes 
in species richness. 
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Statistical analyses. All statistical analyses were conducted ina GLMM (generalized 
linear mixed effects models) framework”. Analyses were conducted in R v.2.15.2 
(R Development Core Team 2012) and relied extensively on vegan, lme4, nlme and 
AlCcmodavg packages. All variables were tested for normality to meet assump- 
tions of statistical tests. 

In our first study contrasting a Costa Rican countryside ecosystem and an island 
ecosystem, we conducted three suites of analyses for each dimension of bat biodi- 
versity (that is, species richness estimates, species evenness estimates, and the Assem- 
blage Abundance Shift Index). Each set of analyses was designed to identify the 
effects of ecosystem change on bat biodiversity—specifically concentrating on fragment/ 
island size, isolation from contiguous tropical forest, general habitat type, and total 
forest area within a wide range of spatial scales (see the section above on selecting 
spatial scales for measuring forest area). All models avoided collinearity of explan- 
atory variables by disqualifying any linear models that used multiple dependent 
variables (for example, fragment or island size and total forest area are collinear 
and were therefore not used in the same statistical model to explain any of the bat 
biodiversity response variables). 

At the onset of each suite of analyses we explicitly tested for interaction terms 
between landscape variables and the ecosystem type (that is, countryside ecosys- 
tem versus island ecosystem). The logic behind testing for interaction terms stems 
from the fact that the countryside ecosystem and the island ecosystem are separated 
by about 350 km; they have slightly different species pools and therefore different 
intercepts within a linear modelling framework. We therefore tested statistically 
for differences between ecosystems through differences in slopes of functions by 
using interaction terms in our modelling, rather than only looking for differences 
in the intercept value. 

In each suite of analyses we first tested for differences between the countryside 
ecosystem and the island ecosystem and restricted our analysis to forested habitats 
by excluding data from coffee plantations (Extended Data Tables 1 and 2). After 
differences were confirmed between the two ecosystem types, the second suite of 
analyses included data from coffee plantations and focused on fitting the best model 
and selecting the best landscape variable or variables for predicting the bat biodi- 
versity response variable of interest (for example estimated species richness, esti- 
mated species evenness or the Assemblage Abundance Shift Index). 

After differences between ecosystems were tested, each ecosystem was modelled 
independently (that is, analyses were performed with data from only a single eco- 
system, rather than including a factor within each model that denoted ecosystem 
type as countryside ecosystem or island ecosystem). Nonlinear functions were included 
in ecosystem-specific analyses, and the most relevant models are presented in 
Extended Data Tables 3 and 4. 

In all analyses, models were compared by using the corrected Akaike informa- 
tion criterion (AIC,) and maximum likelihood. Because landscape forest areas mea- 
sured at multiple spatial scales were autocorrelated and collinear, we identified a 
single appropriate spatial scale by comparing models using AIC, and maximum 
likelihood. Little differences in AIC, values were observed between landscape 
variables measured at different spatial scales. 

In the GLMM framework we made use of spatial correlation structures to avoid 
issues of spatial autocorrelation*’. The UTM (Universal Transverse Mercator) coor- 
dinates of each site or net, measured as the centre of each site or net, were used to 
account for spatial autocorrelation between sites and nets. Models were compared 
that used a variety of correlation structures, including ‘corSpher’, ‘corLin’, ‘corRatio’, 
‘corGaus’ and ‘corExp’ (see pp. 161-170 of ref. 54 for more information). The 
“‘corRatio’ correlation structure, or the ratio correlation structure, consistently per- 
formed best in all models and in all analyses. All analyses make use of the ratio cor- 
relation structure with a ‘nugget’. Moreover, site identity was included as a fixed, 
random effect in analyses conducted at the net level. 

The three suites of analyses focused on identifying how the response variables 
of, respectively, bat species richness estimates, bat species evenness estimates, and 
the bat Assemblage Abundance Shift Index respond to fragment or island size, 
isolation from contiguous tropical forest, general habitat type, and total forest area 
within a range of spatial scales. The model comparisons for each suite of analyses 
are presented in Extended Data Tables 1-4. Results from models that included 
forest area measured at multiple spatial scales are not presented for clarity, but few 
differences between models were observed (see the section above on selecting spa- 
tial scales for measuring forest area). 

Meta-analysis of studies of bat biodiversity across countryside and island eco- 
systems worldwide. We searched for all relevant research articles from any time 
period by using the following Web of Science search query: Title = (bat) AND 


Topic = (conservation and fragment”), Timespan = All years, Search language = 
English. Moreover, we asked authors of many of the research articles we reviewed 
to ensure that our list was representative—especially authors whose studies were 
conducted outside the Neotropics (Fig. 4a). From this list we reviewed articles and 
retained studies that included comparisons between habitats, studies that sampled 
a minimum of ten different bat species, and studies with replicate sampling sites 
and similar sampling effort in each habitat type. 

For studies that met our criteria, we compared species richness between habitat 
types in each study. Habitat types in each study fell into the following three broad 
categories: minimally altered reference habitats (for example forest reserves or con- 
tiguous mainland forest); fragmented or isolated reference habitat (for example 
forest fragments and natural or human-made, true islands); or human-made or 
naturally open habitat (for example cropland, pasture and, in one study, African 
savanna). When sufficient data were available, the bias-corrected Hedge’s g* was 
calculated and used to determine whether bat species richness differed between 
habitat types. Reported statistical differences in species richness were used in 
studies if there were insufficient data to calculate Hedge’s g*. 

Wide arrays of metrics were used across studies to report bat assemblage shifts 
between habitat types. Because it is difficult to compare multiple measures of bat 
assemblage shift between studies, we chose to report assemblage shifts in each 
study as observed or unobserved. The detailed list of studies used is presented in 
Supplementary Data. 
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Extended Data Figure 1 | Bats use a variety of habitats in the countryside _ coffee plantations (yellow).The proportions of individuals captured in smaller 
ecosystem. Shown are the proportions of captured individuals from 30 species _ forest fragments of various sizes are also shown. Total numbers of individuals 
in different countryside habitats. Forest dependence rank ranges from forest _ per species are listed parenthetically after abbreviated species names. A total 
avoidance (left side of x axis) to reserve dependent (right side of x axis) and was _ of 4,424 individuals are represented. 

determined by comparing relative abundance in reserves (green) with that in 


©2014 Macmillan Publishers Limited. All rights reserved 


LETTER 


Countryside Ecosystem Island Ecosystem 


ALas Cruces Forest 
@Forest fragment 
© Coffee plantation 


Abundance-Based Assemblage Similarity Index 


A Mainland 
Edge 
@ Island 
0 5 10 15 20 0 5 10 15 20 
Species richness Species richness 


Extended Data Figure 2 | Assemblage Abundance Shift Index based on 
ordination analyses of bat abundances and how they collectively shift 
relative to bat abundances in minimally altered habitat. The plots 
demonstrate how the Assemblage Abundance Shift Index accounted for 
changes in species richness to focus on predicting changes in assemblage-level 
shifts in abundances between habitats. In both ecosystems regression analyses 
favoured logarithmic relationships between the abundance-based assemblage 
similarity of the bats captured in a net relative to the reserve or mainland nets 
and the observed species richness of the bats captured in the net (see Methods). 
Logarithmic models (solid lines) outperformed linear models in model 
comparisons (countryside ecosystem AAIC. = 22.75; island ecosystem 
AAIC, = 5.92). For each ecosystem, logarithmic models were used to calculate 
the residual assemblage shift for each net that was not explained by changes in 
species richness but by changes in the abundances of species. The residuals 
are therefore an index of assemblage abundance shifting after accounting for 
changes in species richness. Regression coefficients and statistics are described 
in Extended Data Table 3. 
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Extended Data Table 1 | Summary of model performances of forest habitat comparisons in an island ecosystem and a countryside ecosystem 


total forest area within 1 km [%] * ecosystem type [factor 7 176.58 0.00 0.59 0.59 | -78.75 


general habitat type [factors: island, mainland edge, mainland interior, forest fragment, reserve] 


2.20 


0.20 


0.79 


+ isolation distance [km] * ecosystem type [factor] 


178.78 
8 | 180.56 


3.98 


0.08 


0.87 


total forest area within 1 km [%] + ecosystem type [factor, 6 181.33 4.75 0.06 0.93 | -82.84 


log(fragmentisland size [ha]) * ecosystem type [factor] 


0.04 


total forest area within 1 km [%] 


0.01 


log(fragmentisland size [ha]) + ecosystem type [factor] 


0.04 


log(fragmentiisland size [ha]) + isolation distance [km] + ecosystem type [factor 
isolation distance [km] * ecosystem type [factor] 


10.53 


0.00 
0.00 


1.00 


isolation distance [km] + ecosystem type [factor] 


7 
5 
6 
7 
7 
6 


10.90 


0.00 


1.00 


log(fragmentiisland size [ha 
log(fragmentlisland size [ha]) + isolation distance [km] 


187.11 
187.48 
5 | 188.58 


12.00 
14.45 


0.00 
0.00 


1.00 
1.00 


Null [intercept 
isolation distance [km] 


6 | 191.03 
4 | 195.35 


18.77 


0.00 


1.00 


~“N 


isolation distance [km] * ecosystem type [factor -36.47 0.00 0.97 0.97 27.78 


isolation distance [km] + ecosystem type [factor] 


total forest area within 1 km [%] + ecosystem type [factor] 


log(fragmentisland size [ha]) + isolation distance [km] + ecosystem type [factor] 


Null [intercept] 
general habitat type [factors: island, mainland edge, mainland interior, forest fragment, reserve] 


14.37 


0.00 


1.00 


22.48 


log(fragmentlisland size [ha]) + ecosystem type [factor] 


OD] Co} | NI] |] 


16.85 


0.00 


1.00 


17.63 


total forest area within 1 km [%] * ecosystem type [factor] 
log(fragmentiisland size [ha]) 


-22.09 
-19.61 
-19.51 


N 


16.96 
17.04 


0.00 
0.00 


1.00 
1.00 


19.30 
15.97 


total forest area within 1 km [% 
log(fragmentlisland size [ha]) + isolation distance [km] 


-19.43 
-19.36 


ayn 


17.11 
18.15 


0.00 
0.00 


1.00 
1.00 


15.93 
16.98 


log(fragment/island size [ha]) + isolation distance [km] * ecosystem type [factor] 
log(fragmentiisland size [ha]) * ecosystem type [factor 
isolation distance [km] 


co] a> 


-18.31 
-16.96 


NI 


19.50 


0.00 


1.00 


19.91 


1973 |__0.00| 1.00 | 17.91 


a 


6.67 


43.13 


0.00 


1.00 


2.92 


total forest area within 150 m [%] + ecosystem type [factor 


log(fragmentisland size [ha]) * ecosystem type [factor] 


~“ 


2.28 


0.20 


0.81 


-822.21 0.00 0.61 0.61 | 418.30 


418.22 


total forest area within 150 m [%] 


2.96 


0.14 


0.95 


415.77 


log(fragmentlisland size [ha]) + ecosystem type [factor 
total forest area within 150 m [%] * ecosystem type [factor] 


—™I} >| Co 


-819.93 
-819.25 
814.46 


7.15 
“17 


0.01 
0.01 


0.96 
0.98 


414.43 
415.47 


log(fragmentisland size [ha]) + isolation distance [km] + ecosystem type [factor] 


8.18 


0.01 


0.99 


415.27 


log(fragment/island size [ha]) + isolation distance [km] * ecosystem 


log(fragmentiisland size [ha] -811.89 10.32 0.00 0.99 | 412.09 


general habitat type [factors: island, mainland edge, mainland interior, forest fragment, reserve 
isolation distance [km] + ecosystem type [factor] 


814.44 
814.03 
812.19 


CO} G>| CO} Co} Co 


10.02 


0.00 


0.99 


415.41 


810.56 | 11.65 0.00 1.00 | 414.60 


11.85 


0.00 


1.00 


412.38 


log(fragment/island size [ha]) + isolation distance [km] 


-810.36 
-809.93 


NN 


12.28 


0.00 


1.00 


412.16 


isolation distance [km] * ecosystem type [factor -808.51 13.70 0.00 1.00 | 412.51 


Null [intercept] 


foe) 


-805.75 


16.46 


0.00 


1.00 


407.98 


isolation distance [km] 


Oo 


-805.35 


16.86 


0.00 


1.00 


408.82 


The summary shows comparisons of forest fragments embedded in a human-dominated landscape and true forest islands surrounded by water; see Extended Data Table 3 for analyses with deforested habitat 
matrix sampling included. Listed are relevant outputs summarizing the performances of generalized linear mixed effects models predicting estimated bat species richness (green heading), bat species evenness 
(orange heading) and the Species Assemblage Abundance Shift Index (purple heading; see Methods), using a variety of landscape metrics as explanatory variables. Bat biodiversity and landscape data were 
gathered from a Costa Rican countryside ecosystem and a Panamanian island ecosystem (that is, ecosystem type). All models are generalized linear mixed-effects models with Gaussian error structures and 
identity links. In addition, all models include a ‘corRatio’ correlation structure based on site or net coordinates (UTM). Net-level analyses included site identity as a random effect in all models. 
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Extended Data Table 2 | Regression coefficients and relevant statistics generated from best-fit models from Extended Data Table 1 


intercept 30 6.048 0.000 
total forest area within 1 km [%] 30 5.225 0.000 
ecosystem type [factor 30 4.906 0.000 


total forest area within 1 km * ecosystem type 30 -2.931 0.007 


intercept 30 15.948 0.000 


isolation distance [km] 30 -4.672 0.000 
ecosystem type [factor] 30 4.395 0.000 
isolation distance * ecosystem type 30 3.960 0.001 


Intercept 262 -2.641 0.009 
total forest area within 150 m [%] 262 2.979 0.003 
ecosystem type [factor] 28 2.279 0.031 


The models explain relationships between three dimensions of bat biodiversity and the remaining landscape variables after model optimization for forest habitats in an island ecosystem and a countryside 
ecosystem. Model performances and comparisons are listed in Extended Data Table 1. Summarized models are in bold in Extended Data Table 1. After model selection, best-fit models were rerun and are 
presented here using restricted maximum likelihood®*. Data from the deforested habitat in the countryside ecosystem (that is, sun coffee plantations) are not included in these analyses. In-depth ecosystem- 
specific analyses that include the deforested habitat in the countryside ecosystem are presented in Extended Data Tables 4 and 5. 
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Extended Data Table 3 | Summary of model performances of ecosystem-specific models 


broad habitat type [factors: coffee plantation, forest | 5 |] 113.93 | 0.00 0.49 0.49 | -49.66 


general habitat type [factors: coffee plantation, forest fragment, reserve] 


6 
total forest area within 1 km [%] 5 
isolation distance [km] 5 
4 
5 


Null [intercept] . . : 
log(fragment size [ha]) 121.10 7.16 0.01 1.00 
log(fragment size [ha]) + isolation distance [km] 6 123.92 9.99 0.00 1.00 


total forest area within 1 km [%] 0.00 0.87 0.87 
broad habitat type [factors: coffee plantation, forest] 5.71 0.05 0.92 
log(island size [ha 6.09 0.04 0.97 
general habitat type [factors: coffee plantation, forest fragment, reserve] 7A7 0.02 0.99 
log(island size [ha]) + isolation distance [km] 9.31 0.01 1.00 
isolation distance [km 12.65 0.00 1.00 


ull [intercept 67 : . ‘ -51.67 
Null [i ] 4 114.6 14.06 0.00 1.00 1.6 


sos[ Ute] —ose 300 
isolation distance [km 5 -50.19 3.05 0.12 0.69 | 32.40 
broad habitat type |factors: coffee plantation, forest] 3.65 0.09 0.78 | 32.10 
total forest area within 1 km [%] 5 -49.49 3.75 0.09 0.86 32.05 
general habitat type [factors: coffee plantation, forest fragment, reserve] 5.66 0.03 0.98 | 33.29 

6 46.12 


log(fragment size [ha]) + isolation distance [km] 7.12 0.02 1.00 32.56 


isolation distance [km] -7.99 0.00 0.77 0.77 | 11.72 
log(island size [ha]) + isolation distance [km] 471 3.27 0.15 0.92 12.56 
total forest area within 1 km [%] 

general habitat type [factors: coffee plantation, forest fragment, reserve] 

broad habitat type [factors: coffee plantation, forest] 


In(total forest within 150 m [%]) 0.00 0.91 0.91 | 443.59 
total forest area within 150 m [%] -870.21 4.67 0.09 1.00 | 441.26 
log(fragment size [ha] 847.65 | 27.23 0.00 1.00 | 429.98 
log(fragment size [ha]) + isolation distance [km] 28.86 0.00 1.00 | 430.22 
isolation distance [km -838.98 | 35.90 0.00 1.00 | 425.64 


log(island size [ha]) 6 | -241.10 0.00 0.37 0.37 | 126.99 
broad habitat type [factors: island, mainland] 6 | -240.48 0.62 0.27 0.64 | 126.68 
total forest area within 350 m [%] 6 | -239.24 1.86 0.14 0.78 | 126.06 
general habitat type [factors: island, mainland edge, mainland interior] 7 | -238.17 2.93 0.08 0.98 | 126.68 
isolation distance [km] 6 | -233.82 7.28 0.01 0.99 | 123.35 


5 | -23353 | 7.57 | 0.01 | 1.00 | 122.08 


The models used were those predicting estimated bat species richness (green heading), bat species evenness (orange heading) and the Assemblage Abundance Shift Index (purple heading), using a series of 
landscape metrics as explanatory variables. All models are generalized linear mixed-effects models with Gaussian error structures and identity links. In addition, all models include a ‘corRatio’ correlation structure 
based on each site’s or net’s UTM coordinates. Net-level analyses included site identity as a random effect in all models. 
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Extended Data Table 4 | Regression coefficients and relevant statistics generated from best-fit models from Extended Data Table 3 


(Intercept) 18.389 1.425 12.900 0.000 


broad habitat type (forest) [factors: coffee plantation, forest] 6.235 1.723 3.618 0.002 


Intercept 18.488 1.351 13.682 0 


habitat type (forest fragment) [factors: coffee plantation, 
forest fragment, reserve 4.997 1.698 2.943 0.01 


habitat type (reserve) [factors: coffee plantation, forest 
fragment, reserve 9.016 2.062 4.373 0.001 


Intercept 8.953 1.309 6.837 0.000 


total forest area within 1 km [%] 18.680 3.202 5.834 0.000 
(Intercept) 9.814 1.490 6.586 0.000 
log(island size [hal) 2.623 0.572 4.582 0.000 


17.304 
-2.612 


(Intercept) 0.575 0.045 12.758 0.000 
isolation distance [km] -0.114 0.031 -3,738 0.002 


(Intercept) 


In(total forest area within 150 m [%]) 


0.030 | 0.012 | 2.557 | 0.012 
broad habitat type (island) [factors: island, mainland] -0.046 0.015 -3.169 0.006 


*The second most competitive model is presented for predicting the Assemblage Abundance Shift Index in 
the island ecosystem. Island size and broad habitat types in the island ecosystem (i.e., islands and 
mainland) are confounded. Moreover, island size did not explain the community shift index when mainland 
sites were excluded from analysis (regression statistics for islands only: t= 0.561, P= 0.588, N = 66). 
Therefore, we concluded that island size is not a better predictor of community shift than broad habitat type. 
Model coefficients and statistics correspond with the best-fit lines in panel c of Figure 3 in the main text. 


The models explain ecosystem-specific relationships between three dimensions of bat biodiversity and the remaining landscape variables after model optimization. Model performance and comparisons are listed 
in Extended Data Tables 1-3, in which the summarized models are identifiable in the list by bold text. After model selection, best-fit models were rerun and are presented here using restricted maximum likelihood. 
Additional, less competitive models using island size and isolation distance to predict estimated species richness are also presented for the island ecosystem because of their relevance to island biogeographic 
theory. 
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Extended Data Table 5 | Regression coefficients and relevant statistics generated from best-fit models accounting for species richness in the 
Abundance-Based Assemblage Similarity Index 


(intercept) 11.868 | 0.000 
| In(observed number of species) 0.158 0.019 8.381 | 0.000 


The models explain relationships between the Abundance-Based Assemblage Similarity Index and observed species richness for each net (see Methods). Relationships are plotted in Extended Data Fig. 2 and were 
used to calculate the Assemblage Abundance Shift Index used throughout analyses and results. 
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Extended Data Table 6 | Observed diurnal roosts in deforested habitats in the countryside ecosystem 


Offspring 


Species Common name Roost structure 
observed 


Glossophaga commissarisit Commissaris’ long-tongued bat | Abandoned building on farm Yes 


Glossophaga soricina*tt Common long-tongued bat Abandoned building on farm Yes 


Carollia castaneat Chestnut short-tailed bat Abandoned building on farm Yes 


Carollia perspicillata* tt Seba’s short-tailed bat Abandoned building on farm Yes 


Carollia sowellit Sowell’s short-tailed bat Abandoned building on farm Yes 


Live fence of “Cafia India” or 
Dracaena fragrans 
Ornamental, non-native 
palm tree on farm 
Live fence of “Cafia India’ or 
Dracaena fragrans 
Heliconia pendula planted 
near house 


Artibeus jamaicensis* Jamaican fruit-eating bat No 


Artibeus lituratus Great fruit-eating bat 


Uroderma bilobatum*tt Common tent-making bat 


Thyroptera tricolor Spix’s disk-winged bat 


* Indicates that roost was found by diurnal location of individuals fitted with radio telemetry transponders. 
t Indicates that individuals fitted with radio telemetry transponders were originally captured inside forest 
fragments, but roosted in deforested habitat. 

t Indicates that roost was used by the same species at least 12 months after initial discovery. 


Casual observations of bat roosts were recorded during bat sampling and related field work in Costa Rica. Some roosts were located using 0.25-g radio telemetry transponders, temporarily attached to some 
animals in a related study. Radio transponders were temporarily attached using techniques adapted from birds*®. All roosts were found between 24 January and 28 March from 2009 to 2012 and were checked 
occasionally for use and offspring. The only species whose roost was not used over multiple days was Thyroptera tricolor. All roosts hosted multiple individuals of the species listed. 
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Consequences of biodiversity loss for litter 
decomposition across biomes 


I. Tanya Handa’, Rien Aerts’, Frank Berendse’, Matty P. Berg”, 
Mark O. Gessner?’®!0"!, Jeremy Jabiol®?, Marika Makkonen*”’, 


Andreas Bruder®’®, Olaf Butenschoen’, Eric Chauvet*”’, 
Brendan G. McKie“, Bjorn Malmqvistt, 


Edwin T. H. M. Peeters!®, Stefan Scheu’, Bernhard Schmid?°, Jasper van Ruijven*, Veronique C. A. Vos* & Stephan Hattenschwiler! 


The decomposition of dead organic matter is a major determinant 
of carbon and nutrient cycling in ecosystems, and of carbon fluxes 
between the biosphere and the atmosphere’*. Decomposition is 
driven by a vast diversity of organisms that are structured in com- 
plex food webs”. Identifying the mechanisms underlying the effects 
of biodiversity on decomposition is critical** given the rapid loss of 
species worldwide and the effects of this loss on human well-being’ ’. 
Yet despite comprehensive syntheses of studies on how biodiversity 
affects litter decomposition**”, key questions remain, including when, 
where and how biodiversity has a role and whether general patterns 
and mechanisms occur across ecosystems and different functional 
types of organism*”* ”. Here, in field experiments across five terrest- 
rial and aquatic locations, ranging from the subarctic to the tropics, 
we show that reducing the functional diversity of decomposer organ- 
isms and plant litter types slowed the cycling of litter carbon and nitro- 
gen. Moreover, we found evidence of nitrogen transfer from the litter 
of nitrogen-fixing plants to that of rapidly decomposing plants, but 
not between other plant functional types, highlighting that specific 
interactions in litter mixtures control carbon and nitrogen cycling 
during decomposition. The emergence of this general mechanism and 
the coherence of patterns across contrasting terrestrial and aquatic 
ecosystems suggest that biodiversity loss has consistent consequences 
for litter decomposition and the cycling of major elements on broad 
spatial scales. 

Biological diversity that directly influences litter decomposition exists 
at multiple trophic levels*. This diversity includes plants that produce 
litter mixtures of varying quality, microbial decomposers and inverteb- 
rate consumers of varying body size, the last two of which selectively 
use the heterogeneous resources provided by litter mixtures*"*. General 
principles of the effects of biodiversity on litter decomposition have proved 
elusive: both pioneering work"* and recent syntheses have highlighted 
contrasting effects of litter species richness on decomposition* *’*'*. In 
part, this variation appears to be due to site-specific conditions, includ- 
ing contrasts between aquatic and terrestrial ecosystems, as well as bet- 
ween geographic settings. Further differences may arise from variation 
in experimental protocols, the plant species studied and the types of de- 
composers included in a given experiment. Such methodological discrep- 
ancies have complicated syntheses across studies, hindering the emergence 
of common patterns and mechanisms. 

Here we report the results of the first concerted experiments study- 
ing the effects of biodiversity on decomposition by manipulating diver- 
sity across trophic levels and distinct biomes in both forest floor and 


stream habitats (Extended Data Table 1). We proposed that the functional 
diversity of decomposers (variation in body size) and of leaf litter (vari- 
ation in litter quality) promotes C and N cycling across contrasting loca- 
tions (subarctic to tropical) and ecosystem types (terrestrial versus aquatic). 
Body size encapsulates numerous species traits that are relevant to eco- 
system functioning, and extinction scenarios project that the larger spe- 
cies will be preferentially lost from biological communities'”"*. Similarly, 
plant functional types reflect differences in leaf quality traits that deter- 
mine litter decomposition independently of geographical location’’. Plant 
functional types are defined here in terms of plant C allocation strat- 
egies (deciduous versus evergreen), N acquisition strategies (N-fixing 
versus non-N-fixing) and litter recalcitrance (rapidly decomposing ver- 
sus slowly decomposing) (Extended Data Table 2). 

Mixing leaflitter from various plant functional types together resulted 
in accelerated C and N dynamics, as indicated by the overall net positive 
effects on C and N loss (that is, increased C and N loss with increasing 
functional diversity) (Fig. 1, C loss; Extended Data Fig. 1, N loss; P< 
0.05, C and N loss). However, C loss from litter mixtures was only 
2.9+0.8mgg ‘(mean ~ s.e.m.) of initial litter dry mass greater than 
the expected loss based on data from single litter functional types, in- 
dicating only a modest increase in C cycling as a result of litter mixing. 
Although also statistically significant (P < 0.01), the difference in the 
loss of N across all litter mixtures was very small (0.1 + 0.2mgNg ' 
of initial litter dry mass; mean + s.e.m.; Extended Data Fig. 1). The net 
litter diversity effect on C loss was stronger in terrestrial than in aquatic 
ecosystems (P < 0.001, Fig. 1 and Extended Data Table 3), supporting 
theoretical predictions* but contrasting the results of a meta-analysis in 
which diversity effects on decomposition were significant only for streams®. 
Sorting the litter mixtures into species at the end of the experiments 
enabled us to explore potential reasons for this discrepancy, by parti- 
tioning the net diversity effects into complementarity effects (that is, 
the effects resulting from synergistic or antagonistic interactions) and 
selection effects (that is, the effects arising when the presence of a par- 
ticular functional type with high (or low) process rates dominates a 
mixture)”°. The observed net diversity effects were clearly driven by com- 
plementarity effects that were stronger than selection effects (Fig. 1). 
Overall, the complementarity effect was a similar strength to the net 
effect for C loss (3.4 + 0.9 mg C g_'), and even stronger than the net 
effect for N loss (1.0 + 0.2 mg N g_'). By contrast, the mean selection 
effects were not significant. The characteristics of the forest floor habitat 
that may favour complementarity effects include strong fluctuations in 
temperature and humidity and a homogenous litter cover*. Conversely, 
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Figure 1 | Net diversity, complementarity and selection effects of plant litter 
mixtures on C loss. The net diversity effect is the deviation from the expected 
mean based on C loss measured from litter consisting of single species. The 
blue and brown circles show the mean effects (+s.e.m.) on C loss from litter 
mixtures in forest streams and on forest floors, respectively, in subarctic (SUB), 
boreal (BOR), temperate (TEM), Mediterranean (MED) and tropical (TRO) 
locations. Each circle to the right of the dashed lines shows the mean effect per 
ecosystem type (that is, aquatic versus terrestrial), as calculated across the three 
types of decomposer community (n = 165 litter mixtures per location and 
ecosystem type; see Extended Data Table 3 for statistical analyses). The circles 
to the left of the dashed lines show the overall mean across all locations. 


the observed negative complementarity effects in subarctic and tropical 
streams could reflect a low density and low taxon richness of litter con- 
sumers (Extended Data Table 4) and thus limited potential for comple- 
mentary resource use”’. 

Our experiments also show that completeness of the decomposer com- 
munity, which is rarely considered in large-scale studies, is important 
for C and N dynamics during litter decomposition (Fig. 2, Table 1 and 
Extended Data Table 5). The presence of medium-sized invertebrates 
(=1 mm in diameter) in the decomposer community increased the aver- 
age C and N loss across all sites by 2.1 + 0.8% and 2.0 + 1.0%, respect- 
ively. The complete decomposer community (which included organisms 
up to 5 mm in diameter) increased the average C loss across all sites by 
10.6 + 1.0% and the average N loss across all sites by 11.1 + 1.2% (Fig. 2). 
This effect was consistently positive across all but the Mediterranean 
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Figure 2 | Effect of decomposer community completeness on litter C and N 
loss. C loss (left) and N loss (right) from all litter treatments (all single species 
and all mixtures) exposed to medium-sized decomposers (top; percentage 
difference compared with the smallest mesh size) and the complete decomposer 
community (bottom; percentage difference compared with the smallest 

mesh size). The blue and brown bars show mean effects (+s.e.m.) in forest 
streams and on forest floors, respectively, in the five indicated locations (n = 45 
litter treatments per location per ecosystem type; see Table 1 for statistical 
analyses). 


terrestrial site. Thus, the presence of large fauna clearly has a major 
impact on decomposition (Table 1), as reported previously”; how- 
ever, in line with previous studies, the importance of large fauna varies 
among locations in aquatic” and terrestrial ecosystems”. In our study, 
the strong effects of the complete decomposer community at the tem- 
perate and tropical locations correspond to high relative abundances 
of millipedes and termites at the terrestrial temperate and tropical sites, 
respectively (Extended Data Table 6). Similarly, the large effect of the 
complete decomposer community at the temperate aquatic site corre- 
sponds to the high abundance ofa particularly efficient amphipod detri- 
tivore (Extended Data Table 4). Our data clearly indicate that the large- 
bodied organisms are the most critical for decomposition. These animals 
also tend to face the greatest extinction risk’’. 

Litter mixing and completeness of the decomposer community inter- 
acted with each other to affect C and N loss, although this interaction 
explains less of the variance than the main effects (Table 1). C loss and, 
even more so, N loss increased in the presence of particular plant func- 
tional types and with increasing completeness of the decomposer com- 
munity (Table 1 and Extended Data Table 7). Although the type of 
decomposer community did not significantly change the net effect of 
diversity on Closs (P = 0.67) or N loss (P = 0.30) (Extended Data Table 3), 
it emerged as a significant factor in the selection effect for both C loss 
(P< 0.05) and N loss (P < 0.05). Additionally, the interaction between 
the rapidly decomposing litter type and the decomposer community was 
significant in explaining the selection effect and the overall net diversity 
effect on C loss (P < 0.05) and N loss (P < 0.05), suggesting that large 
decomposers are particularly important drivers of C and N loss from 
litter mixtures that contain rapidly decomposing litter. The food pre- 
ference behaviour of decomposers could be important in accounting 
for this result, as has previously been implied for terrestrial” and aquatic”® 
ecosystems. 

A key result of our large-scale study is that the effects of litter diver- 
sity on Cand N dynamics can be largely explained by the presence of par- 
ticular functional plant types in litter mixtures, supporting the idea that 
the range and relative abundance of plant traits in ecosystems underlie 
the effects of species richness on ecosystem processes”””*. The effects of 
the presence of litter from particular plant functional types, or the inter- 
actions among these, were consistent across locations at both terrestrial 
and aquatic sites, together accounting for about 10% of the total vari- 
ance as shown in the full analysis of variance model (Table 1, P< 0.05; 
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Table 1 | Variance in C and N loss associated with diversity and sites 


Factor C loss N loss 

DF %SS F P %SS F P 
Variation associated with diversity 
Litter community: litter richness* 1 0 0) 0.93 1.0 63.7 aa 
Litter community: remaining variation associated with FT composition+ 13 10.1 66.0 ins 8.9 444 os 
Decomposer community (small, medium-sized or complete) 2 58 247 ae 44 142 one 
Litter community x decomposer community 28 0.8 2.45 =e 13 3.1 — 
Variation associated with sites 
Location (tropical, Mediterranean, temperate, boreal or subarctic) 4 12.1 52.8 “e% 16.9 66.5 ae 
Ecosystem (aquatic stream or terrestrial forest floor) 1 6.4 lis ony 14 22.3 7a 
Location xX ecosystem 4 6.3 27.7 ee 8.0 31.6 elk 
Block (within location x ecosystem) 40 23 4.87 da 25 4.13 = 
Variation associated with diversity or sites 
Decomposer community X location 8 8.6 92.1 eee 75 61.3 = 
Remaining variance 388 2h 5.95 eee 214 3.60 aes 
Total variance explained by the model 489 795 13.5 i i338 9.76 one 
Residuals 1,739 20.4 - - 26.7 - - 
Total 2,228 100.0 - - 100.0 - - 


The relative contributions of variance in C and N loss associated with diversity and sites (expressed in percentage sums of squares (% SS)) in a large-scale leaf litter decomposition experiment. The main factors are 
italicized. ***, P<0.001, analysis of variance based on sequential sums of squares (see Methods). See Extended Data Table 5 for details. DF, degrees of freedom; FT, functional type. 
* Plant species (trees or shrubs) were selected to represent the same four functional types at each location (N-fixing, evergreen, rapidly decomposing deciduous and slowly decomposing deciduous). Linear 


functional type richness was fitted before litter functional type compositions. 


+An alternative model omitting richness and testing in detail the litter functional type compositions in a full factorial analysis with contrasts for functional type presence/absence and interactions is presented in 
Extended Data Table 7. That model highlights the importance of the interaction between the litter of the N-fixing functional type and the rapidly decomposing functional type, hinting at a N-transfer mechanism. 


Extended Data Table 7). Beyond the presence or absence of particular 
plant functional types, we found no significant effect of the richness of 
plant functional types in the litter on C loss (P = 0.93), although a 
positive effect was observed on N loss (P < 0.001) (Table 1). The effect 
on N loss was strongest when the most complete decomposer com- 
munities had access to the litter (litter richness x decomposer com- 
munity interaction; P< 0.05). Our results indicate that partitioning 
the diversity effects into the separate contributions of the presence or 
absence of particular plant functional types in litter and their interactions 
can help move interpretations of biodiversity-ecosystem functioning 
experiments beyond the current dichotomy between broad generali- 
zations and claims of idiosyncratic compositional effects”'*"». 

An intriguing finding in this context is that the strongest positive in- 
teraction emerged between two particular litter functional types: N-fixing 
plants and rapidly decomposing deciduous plants (Extended Data Table 7). 
When these types were present together in litter mixtures, the average 
C loss was 13.5% greater than the average C loss of all litter combina- 
tions, and the N loss was 32.5% greater. This general pattern holds across 
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Figure 3 | Relative change in the total amount of litter N. The relative net 
difference between two-species mixtures (containing litter from the N-fixing 
and the rapidly decomposing plants) and monocultures of N-fixing plant litter 
(left) or rapidly decomposing plant litter (right) is shown (mean = s.e.m., 

n = 15; see Extended Data Table 8 for statistical analyses) for litter 
decomposing in terrestrial (brown) and aquatic (blue) ecosystems at five 
locations. The relative net difference was calculated as [(Nim — Nem)/Niml - 
[(Nija- Nea)/Neal, where N;,, and Nj, are the initial (i), and Nr, and Nr, are the 
final (f), amounts of N in a particular litter type in a mixture (m) or alone (a). 
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terrestrial and aquatic ecosystems from the subarctic to the tropics. More- 
over, relative to the total amount of N in the litter initially, less N re- 
mained in the litter of N-fixing plants when rapidly decomposing litter 
was present than when it decomposed alone (Fig. 3 and Extended Data 
Table 8). The rapidly decomposing litter, in turn, contained more N 
when litter from N-fixing plants was present than when it decomposed 
alone (Fig. 3 and Extended Data Table 8). On average across all of the 
sites, the litter of N-fixing plants lost 20.6% of its initial N when it 
decomposed alone but 25.0% when it decomposed in the presence of 
litter from rapidly decomposing plants. By contrast, the litter of rapidly 
decomposing plants lost 18.1% of its N when it decomposed alone but 
13.4% when litter from N-fixing plants was present. This striking pat- 
tern across locations and ecosystems suggests, for the first time from 
field data, that N can be transferred between litter types. A plausible 
mechanism for this effect is that fungal decomposers tap the nutrient 
reservoir of the N-fixing plant litter, boosting C use and fungal growth 
in the N-deficient litter, which provide high-quality C” (see section 
Extended discussion on litter N transfer in Methods). The average net 
differences in N fluxes between single-species litter and litter mixtures 
of these two plant functional types account for approximately 0.25 g N 
per square metre of ground area, representing up to one-tenth of the 
total annual N input from leaf litter fall. Thus, although the biodiver- 
sity effects that we report here, in line with recent syntheses’, are smaller 
than those noted for other ecosystem processes such as plant biomass 
production, these changes in N fluxes can have important consequences 
for the ecosystem. Even slight differences in the N dynamics in litter 
mixtures compared with the respective single-species litter can substan- 
tially change the N supply to primary producers and other organisms 
over large spatial and temporal scales”. 

The implications of our results are that changes in C and N cycling 
in response to biodiversity loss are largely predictable across vastly dif- 
ferent latitudes in both terrestrial and aquatic ecosystems, by taking into 
account relatively simple plant traits and the structural characteristics of 
decomposer communities. To provide robust projections of how eco- 
systems respond to a loss of biodiversity, it is essential to identify the 
mechanisms that result from specific interactions between the compo- 
nents of biodiversity as we describe here. With the consistent patterns 
and mechanisms of biodiversity effects that we have shown, such pro- 
jections now seem to be within reach. 
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METHODS SUMMARY 


The field experiments followed an identical protocol at ten sites, encompassing both 
aquatic (forest stream) and terrestrial (forest floor) ecosystems at five locations across 
a latitudinal gradient spanning from the subarctic to the tropics, with intermediate 
locations in boreal, temperate and Mediterranean climates (Extended Data Table 1). 
Leaf litter from native tree or shrub species representing four common functional 
types (evergreen, deciduous with slowly decomposing litter, deciduous with rapidly 
decomposing litter, and N-fixing) that naturally occur across all locations (18 species 
in total; Extended Data Table 2) was exposed to decomposers in a total of 2,250 exper- 
imental microcosms set up in the field with all possible location-specific single-species 
and multi-species combinations. We used a randomized block design with five blocks 
per site. Each block contained 1 replicate of 15 combinations of litter types (that is, 
all possible combinations of 4 litter species) X 3 microcosm mesh sizes (totalling 
45 microcosms per block). The three mesh sizes used to construct the field micro- 
cosms allowed us to establish three increasingly complete decomposer communities 
(small, medium-sized and complete) in the microcosms. Small-sized decomposer 
communities included microorganisms and fauna that passed through 50-um and 
250-Lum mesh screens (DIATEX) in terrestrial and aquatic ecosystems, respectively. 
The medium-sized decomposer communities contained all organisms (including 
invertebrates) that passed through 1-mm mesh screens, whereas the complete decom- 
poser communities included all organisms that passed through 5-mm mesh screens. 
Litter mass loss was allowed to proceed to the same defined decomposition stage 
(40-50% of the mass of the least recalcitrant litter type remaining at each location; 
Extended Data Table 9) to ensure the comparisons of C and N loss, as well as the 
effects of diversity, at a similar decomposition stage among sites, using analysis of 
variance models (see Methods section). 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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METHODS 


Experimental design. Our field experiments followed an identical protocol at a total 
of ten sites, representing either an aquatic ecosystem (forest stream) or a terrestrial 
ecosystem (forest floor). Five locations were selected across a broad latitudinal 
gradient spanning from the subarctic to the tropics, with intermediate locations in 
boreal, temperate and Mediterranean climates (Extended Data Table 1). Across all 
five locations, and in both the stream and forest ecosystems, the experiments con- 
sisted of a randomized block design in which the leaf litter from 4 common native 
plants (corresponding to the functional types shown in Extended Data Table 2) 
and 11 mixtures of these litter types (corresponding to all possible litter combina- 
tions within a location) were enclosed in nylon mesh screens (DIATEX) and placed 
in the field in five blocks (n = 5 locations X 2 ecosystem types X 15 litter combi- 
nations X 3 mesh sizes X 5 blocks = 2,250 microcosms). The four functional plant 
types represent distinct plant C allocation strategies (deciduous versus evergreen), 
N acquisition strategies (N fixer versus nota N fixer) and litter recalcitrance of decid- 
uous non-N-fixers (rapidly decomposing versus slowly decomposing). 

The three mesh sizes used to construct the microcosms enabled us to distinguish 

three increasingly complete decomposer communities (small, medium-sized and 
complete) that established themselves on the decomposing litter. Small decompo- 
sers included microorganisms and small-sized fauna that passed through 50-um 
and 250-|1m mesh screens in terrestrial and aquatic systems, respectively. The medium- 
sized decomposer communities contained all organisms (including invertebrates) 
that passed through 1-mm mesh screens, whereas the complete decomposer com- 
munities included all decomposers that passed through 5-mm mesh screens. Litter 
mass loss was allowed to proceed to the same defined decomposition stage (40-50% 
of the litter mass of the least recalcitrant litter type remaining at each site; Extended 
Data Table 9) to ensure meaningful comparisons of C and N loss among all sites. At 
all ten sites, extra microcosms containing the fastest decomposing litter type served 
as benchmark indicators of decomposition rates. 
Site characterization. The five stream locations were characterized in terms of their 
geomorphological, physical and chemical features (Extended Data Table 1). Water 
samples were collected for chemical analyses at the time of establishing the experi- 
ments. Samples for inorganic N and phosphorus determination were filtered over 
0.45-um pore-size cellulose acetate membrane filters and transported to the labor- 
atory in a cooler at about 5 °C, where they were frozen for later analysis at Eawag, 
Switzerland. 

At the five forest sites (Extended Data Table 1), the leaf area index was measured 
at breast height on a uniformly cloud-covered day when the forest canopy was fully 
developed, using an LAI-2000 Plant Canopy Analyzer (LI-COR) for the subarctic 
location, an LAI-2200 Plant Canopy Analyzer (LI-COR) for the Mediterranean and 
tropical locations, and a SunScan Canopy Analysis System (Delta T Devices) for the 
temperate locations. Data for the boreal location were provided by K. Bishop & P.-E. 
Mellander. To characterize the soil at each of the terrestrial sites, three samples from 
each experimental block were taken with a soil corer (5 cm diameter, 10 cm height), 
pooled, stored in plastic bags at 4 °C, then sent cooled to the University of Gottingen 
(Germany). Sieved soil samples (<2 mm sieve pore size) were analysed for pH (2 g 
soil in 20 ml 0.01 M CaCl) and C and N concentration (using an NA 1500 Carlo 
Erba elemental analyser). The microbial biomass of the soil was estimated using the 
substrate-induced respiration (SIR) method. The microbial respiratory response was 
measured in an electrolytic O,-microcompensation apparatus at 22 °C. These mea- 
surements were made hourly for 24 h. The microbial biomass was measured after the 
addition of glucose (8 mgCg * dry soil) as a substrate to saturate the catabolic 
activity of the microorganisms. The maximum initial respiratory response (MIRR; 
lO. g * dry mass h *) was calculated as the mean of the lowest three readings 
within the first 10 h, and the microbial biomass was calculated as Cynic = 38 X MIRR 
(in Hg Cynic eo soil dry mass). 

Data loggers (SL52T, Signatrol) were installed in some microcosms at all ten sites, 
to record the temperature every 2 h. These temperature measurements were taken 
in the same litter treatment for all three mesh sizes in three of the five experimental 
blocks. 

Leaf litter collection. A total of 20 litter types were collected at the 5 locations of 
our coordinated experiment. This litter corresponded to the same four functional 
types per location that were introduced above: N-fixing plants, rapidly decompos- 
ing deciduous plants, slowly decomposing deciduous plants and evergreen plants 
(Extended Data Table 2). Litter from these four functional types varies with respect 
to several quality traits*' (Extended Data Table 2). The selected species were com- 
mon native trees or, in two cases, native woody shrubs (Vaccinium vitis-idaea and 
Rhododendron tomentosum) occurring at each location. The litter was collected dur- 
ing location-specific leaf senescence either by hand (V. vitis-idaea and R. tomen- 
tosum) or by using litter traps. An exception was litter from the temperate evergreen 
species Ilex aquifolium, which was obtained by cutting branches in the field and sim- 
ulating senescence in the laboratory for three weeks. Leaves with signs of herbivory 


or disease were discarded. The litter from multiple individual trees or shrubs of 
each species was pooled and dried at 40 °C. 

Leaf litter field incubations. Stream experiments were conducted by exposing 5 g 
litter batches in tetrahedral mesh microcosms (17 cm X 25 cm) made of one of three 
mesh sizes (250 tm, 1 mm or 5 mm) to provide access to decomposer communities 
differing in body size. The microcosms were randomly attached (about 40 cm dis- 
tance between microcosms) to five 20-m metal chains, each ina separate riffle (short, 
relatively shallow sections of streams with non-stagnating water) 20 m or farther 
apart from each other (experimental blocks). The chains were fixed in the stream 
with reinforcing bars in fairly homogeneous sand-gravel stream sections, where 
leaves accumulated naturally. All microcosms were submerged at depths sufficient 
to ensure that they were not exposed to air when water levels dropped. Care was 
taken to expose the litter to constant flow conditions, avoiding deep depositional areas 
(that is, pools and backwaters) with slow or no flow and rocky riffles with broken 
flow. 

Terrestrial experiments on the forest floor were conducted by incubating 8 g 
location-specific litter (4 g only in the subarctic because of limited litter availability 
for some species) in field microcosms made of polyethylene cylinders (15 cm diam- 
eter, 10 cm height) covered with 50-11m mesh at the top and bottom to allow the 
passage of water but to prevent the entry of natural litter fall from above and the 
loss of small litter particles from the bottom. Two windows (5 cm X 18 cm) were 
cut into the cylinders and covered with 50-j1m, 1-mm or 5-mm mesh to provide 
access to decomposer communities differing in body size. The windows were cut 
close to the bottom of the cylinders to ensure that decomposers had access to a 
continuous layer of litter outside and inside the microcosms. An additional 1.5-cm 
height plastic ring of the same diameter as the cylindrical microcosms was attached 
at the bottom of the microcosm, making it possible to push the microcosms gently 
into the top soil (to a depth of 1.5 cm). This ring held the terrestrial microcosms 
properly in place while the bottom mesh was in intimate contact with the soil surface. 
In cases in which pushing the microcosms into the soil was difficult (for example, 
in the tropical forest with its dense superficial tree roots), the 1.5-cm rings were 
fitted with a separate plastic or metal ring before placing the microcosms. The micro- 
cosms were separated from each other by at least 50 cm. They were randomly dis- 
tributed within blocks that were established at least 20 m apart from each other. 
Sample harvest and processing. We removed the decomposing litter of all species 
from the field when 40-50% of the initial litter mass of the fastest decomposing species 
was remaining. As a consequence, the duration of litter decomposition varied among 
locations and ecosystem types (Extended Data Table 9). This procedure ensured that 
similar decomposition stages were sampled at all sites, facilitating meaningful com- 
parisons of decomposition rates and litter diversity effects. All litter samples were 
separated into the constituent species immediately after litter retrieval. The litter 
recovered from the streams was gently washed to remove any adhering material 
and invertebrates. The litter from the terrestrial sites was cleaned by gently brush- 
ing off any dirt without using water, to prevent the leaching of nutrients. The litter 
samples were then dried at 65 °C for 48 h. A correction factor was used to convert 
the initial litter mass (weighed after drying at 40 °C) to the final dry mass, based on 
ten randomly selected samples per litter type that were successively dried and weighed 
in the laboratory first at 40 °C and then at 65 °C. 

Litter C and N loss. The initial C and N concentrations of each of the 20 individual 
litter types were determined from 5 random samples. The final C and N concen- 
trations after retrieval of the litter from the field were also measured for each indi- 
vidually sorted litter type from each microcosm. This process resulted in a total of 
5,400 samples for which to calculate the percentage C and N loss for each litter type 
under the various conditions. Following the determination of litter dry mass, all 
initial and final samples were ground with a ball mill (Retsch PM 400) to a fine homo- 
geneous powder. Subsamples of 3 mg were analysed for C and N concentrations 
using a CHN elemental analyser (Flash EA 1112 Series, Thermo Finnigan). C and 
N loss (%) from the litter during field exposure was calculated as 100 X< [(M; x 
CN;) — (Mr X CN,)] / (M; X CN;), where M; and M, are the initial and final litter 
dry mass, respectively, and CN; and CN¢are the initial and final C or N concen- 
tration (% of litter dry mass). Using C loss (%) rather than total litter mass loss 
allowed us to correct for any possible inorganic contamination of the litter retrieved 
from the field. 

Analyses of diversity effects and statistical models. The net diversity effects, 
comprising complementarity and selection effects, on both C and N loss were cal- 
culated in species mixtures”’. The net diversity effect was calculated as the sum of 
the complementarity and selection effects and contrasts the actual C and N loss 
observed for mixtures of plant functional types with that expected based on the C 
and N loss measured in single-species treatments. The net diversity effect repre- 
sents the sum of synergistic or antagonistic interactions (that is, complementarity 
effects) and those due to the presence of a dominant species (that is, selection effects). 
Data were square-root transformed (keeping the original negative and positive signs 
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for the transformed values) to meet the assumptions for the analysis of variance of 
net diversity, complementarity and selection effects (see details below). 

Analysis of variance models based on sequential sums of squares (type I) were 
used to assess the effects of diversity (the richness of plant litter functional types or 
the presence or absence ofa given functional type and its interaction with other func- 
tional types), the completeness of the decomposer community (small, medium and 
large (complete)), the location across the latitudinal gradient and the ecosystem 
type (terrestrial versus aquatic) on percentage C and N loss. To ensure meaningful 
comparisons across the locations, several standardization methods were tested to 
remove any variation associated with the differences in incubation length. These 
methods included standardizing relative to the following: 1) a standard litter type 
from a non-native plant, Ailanthus altissima, that decomposed at all locations dur- 
ing the experiments; 2) the overall mean C or N loss per mesh size across locations; 
and 3) the mean C or N loss per mesh size of the rapidly decomposing functional 
type across locations. Because the results were consistent irrespective of standardi- 
zation, the final model is presented using the non-standardized data. 

The model terms were fitted to account for the dependency between the rich- 
ness of plant litter functional types and the functional type composition (the pres- 
ence or absence ofa given functional type and interactions between functional types). 
First, functional type composition was partitioned into a contrast for richness and 
residual functional type composition (Table 1 and Extended Data Table 5). Second, 
as shown in Extended Data Table 7, we omitted the richness term and instead re- 
solved the functional type composition into a full factorial analysis with contrasts 
for functional type presence or absence and interactions. In this model, the decom- 
poser community was fitted as a log-linear contrast (small to large mesh size was 
expressed as the logarithm of the mesh size of the microcosms, which produced a 
linear relationship of the three mesh sizes). We also removed all of the other non- 
significant interaction terms in multiple successive model-fitting steps. These two 
alternative analyses reflect different partitionings of the functional type composi- 
tion term into contrasts; they allowed us to compare the explanatory power of the 
richness contrast with the presence/absence contrast. A perfect linear richness effect 
would be found if all presence/absence contrasts had equal coefficients and did not 
interact. In this case, the mean squares or the richness effect with only one degree 
of freedom would be much larger than that of the combined mean squares of the 
presence/absence main effects of the four litter types with four degrees of freedom. 
In both models, the terms ‘location’ and ‘ecosystem type’ were tested at the block 
level. All other terms were tested against the residuals. 

A similar analysis of variance approach was used to test independently for the 
effects of these same factors on complementarity and selection effects, as well as on 
net diversity effects (Extended Data Table 3). In a separate analysis of variance (Ex- 
tended Data Table 8), we also tested whether the net loss of the total amount of N 
relative to the initial amount of N differed when litter of particular plant functional 
types (for example, rapidly decomposing litter and litter of N-fixing plants) decom- 
posed together as opposed to decomposing separately, which we interpreted as an 
indication of N transfer between litter species. The location and ecosystem type 
were also included in this analysis. All statistical analyses were performed with R 
software version 2.8.0. 
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Extended discussion on litter N transfer. Although our data suggest that N was 
transferred from the litter of N-fixing plants to rapidly decomposing litter, alterna- 
tive mechanisms cannot be entirely ruled out. In particular, N incorporated into 
decomposing litter can originate not only from another co-occurring litter type but 
also from the N pool in the soil or stream water or from microbial N fixation”. 
However, N transfer from such alternative N sources does not readily explain the 
concomitant reciprocal changes that we observed between the litter of N-fixing plants 
and rapidly decomposing litter. Moreover, the idea that N transfer occurred between 
the two litter types is further supported by a positive net diversity effect on C loss 
that we observed only when these two particular litter functional types were both 
present (Extended Data Table 3). Additional support for our interpretation comes 
from '°N tracer studies in microcosms with tropical’* and temperate* forest litter, 
which are proof of principle that active biological transfer of N through micro- 
organisms, particularly saprotrophic fungi, can occur. Our large-scale field experi- 
ment suggests that this phenomenon might be widespread across terrestrial and 
aquatic ecosystems and across a wide variety of forest types and climatic conditions. 

It had been proposed that N transfer is driven by a gradient in N concentration 
between litter types*®, the rationale being that the element that limits the decom- 
position rate is N. However, the scenario now unfolding from our experiment (Fig. 3) 
and the recent isotope tracer studies under laboratory conditions”*** is that N trans- 
fer is stoichiometrically controlled. The crucial determinant that defines the gra- 
dient along which N will be transferred in litter mixtures seems to be the demand 
for N relative to the availability of C (and possibly that of other elements critical for 
decomposer growth), rather than differences in the N concentration. A litter with 
high C quality favours rapid microbial growth, which in turn entails a high demand 
for N (and other nutrients), resulting in N acquisition from neighbouring nutrient 
pools. In extreme cases, the N source litter may even have a lower N concentration 
than the N sink litter”’, provided that the C quality of both litter types is sufficiently 
different. In accordance with this mechanism, the decomposition of recalcitrant 
litter types in our study (slowly decomposing and evergreen plant functional types) 
was not accelerated by the presence of litter from N-fixing plant species (Extended 
Data Table 7), although those recalcitrant litter types had similarly low or lower 
initial N concentrations than the rapidly decomposing litter species, which con- 
sistently benefited from the presence of N-fixing plant litter. 
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Extended Data Figure 1 | Net diversity, complementarity and selection 
effects of plant litter mixtures on N loss. The net diversity effect is the 
deviation from the expected mean based on N loss measured from litter 
consisting of single species. The blue and brown circles show the mean 
effects (+s.e.m.) on N loss from litter mixtures in forest streams and on forest 
floors, respectively, in subarctic (SUB), boreal (BOR), temperate (TEM), 
Mediterranean (MED) and tropical (TRO) locations. Each circle to the right 
of the dashed lines shows the mean effect per ecosystem type (that is, aquatic 
versus terrestrial), as calculated across the three types of decomposer 
communities (n = 165 litter mixtures per location and ecosystem type; see 
Extended Data Table 3 for statistical analyses). The circles to the left of the 
dashed lines show the overall mean across all locations (n = 825 litter 
mixtures per ecosystem type). 
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Extended Data Table 1 | Characteristics of aquatic and terrestrial ecosystems at five widely dispersed locations 


LETTER 


Aquatic Locations Subarctic Boreal Temperate Mediterranean Tropical 

Site Kopperasen Krycklan Mosbeek Maureillas Petit Saut 
Sweden Sweden The Netherlands France French Guiana 

Coordinates 68°26' N, 18°28'E 64°16' N, 19°50' E 52°26' N, 5°32'E 42°28' N, 2°48'E 5°04' N, 53°00' W 

Picea abies Quercus ilex, . 
‘ a Salix spp., . m ; ~140 different 

Dominant riparian tree cover : Betula pubescens Fagus sylvatica Corylus avellana, mc . 
Sorbus aucuparia Alnus incana Alnus glutinosa Panery specs 

Altitude (m a.s.l.) 445 200 37 180 35 

Catchment (km?) 11 19.2 4.9 18.5 0.6 

Distance to source (km) 1.6 4.0 1.3 8.8 1.0 

Stream order 1 3 2 3 2 

Slope (%) AZT. 2 0.7 2.3 2 

Mean channel width (m) 25 4.6 2.5 4.5 2.6 

Mean depth (m) 0.12 0.38 0.19 0.19 0.20 

MAT (°C) 0.9 1.8 10.2 15.4 25.4 

M Jan T (°C) -9.7 -10.0 3.6 8.3 24.9 

M July T (°C) 12.3 15.2 17.4 23.9 25.2 

MAP (mm) 352 643 814 766 2519 

Mean stream temperature 6.4 4.6 i 5.7 24.4 

during experiment (°C) 

Temperature range during 4 e 

experiment (°C) 0.6-14.2 0.1-9.9 §.5-9.3 1.9-11.3 23.5-25.4 

Conductivity (uS/cm) 122 19 233 190 19 

pH 7.3 F | 6.8 7.9 re 

Dissolved oxygen (mg/L) 9.3 11.0 11.3 11.4 n.d. 

Alkalinity (mmol/L) 0.78 0.55 0.33 1.33 0.23 

N-NO; (mg/L) 0.02 0.27 7.45 0.18 <0.02 

N-NH,. (ug/L) n.d. <20 154 20 <5 

SRP" (ug/L) <2 18 16 18 <1 

Terrestrial Locations Subarctic Boreal Temperate Mediterranean Tropical 

Site Abisko, Krycklan, Leuvenemse Bos, Barroubio, Paracou, 
Sweden Sweden The Netherlands France French Guiana 

Coordinates 68°21'N, 18°49'E 64°14' N, 19°50' E 52°18' N, 5°41'E 43°23' N, 2°51'E 5°18" N, 52°55' W 

Quercus ilex, : 
‘ Betula pubescens, ‘ - ” ~140 different 
Dominant tree cover Betula pubescens Picea abies Fagus sylvatica Fraxinus : canopy species 
angustifolia 
: 1.71 + 0.15 (tree) t 

Leaf area index (LAI) 3.28 + 0.29 (shrub) 6.5 3.85 + 0.12 3.66 + 0.02 5.72 + 0.25 

MAT (°C) 0.9 1.8 10.7 14.6 25.4 

M Jan T (°C) -9.7 -10.0 4.3 7.4 24.9 

M July T (°C) 12.3 15.2 18.0 23.3 25.2 

MAP (mm) 352 643 911 670 2519 

Soil pH 4.63 + 0.27 4.70 + 0.12 3.29 + 0.06 6.56 + 0.23 4.80 + 0.04 

Soil Cmic (ug g')® 4602 + 2870 1266 + 589 2937 + 820 439 + 87 648 + 37 

Soil C (%)° 27.36 + 0.60 3.63 + 0.06 25.21 + 1.03 4.57 + 0.72 2.84 + 0.30 

Soil N (%)° 0.95 + 0.01 0.19 + 0.00 1.14 40.03 0.39 + 0.04 0.19 + 0.01 


* Means were calculated based on 10-year records between 1998 and 2008 from the closest possible meteorological station. 
+ Soluble reactive phosphorus ~ ortho-phosphate. 

{Data courtesy of K. Bishop and P.-E. Mellander. 

§ Soil microbial biomass (C,,i,), Soil C and soil N are expressed on a dry mass basis. 
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Extended Data Table 2 


Location 


Subarctic 
Boreal 
Temperate 
Mediterranean 


Tropical 


Quality trait” 

N 

Lignin 

Water saturation 


Condensed tannins 


N-fixing 


Alnus incana (L.) 
Moench 


Alnus incana (L.) 
Moench 


Alnus glutinosa (L.) 
Gaertn. 


Alnus glutinosa (L.) 
Gaertn. 


Diplotropis purpurea 
(Rich.) Amshoff 


2.15 + 0.28 


17.24 3.6 
238 + 37 


1.48 + 0.84 


Plant Functional Type 
Deciduous, Rapidly Deciduous, 
decomposing Slow- decomposing 
Sorbus aucuparia L. Populus tremula L. 
Prunus padus L. Betula pubescens 
Ehrh. 
Salix cinerea L. Fagus sylvatica L. 
Fraxinus angustifolia Vahl _—_ Pistacia terebinthus L. 
Qualea rosea Aubl. Vochysia densiflora 
Spruce ex Warm 
0.97 + 0.11 0.73 + 0.10 
12.0+2.9 16.5+42.5 
332 + 182 222 + 114 
1.76 + 0.54 2.68 + 0.44 


Plant functional types, species identity and litter quality traits 
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Evergreen 

Vaccinium vitis-idaea L. 
Rhododendron 
tomentosum Harmaja 
Ilex aquifolium L. 


Quercus ilex L. 


Eperua falcata Aubl. 


1.05 + 0.18 


22.44 4.1 
123 + 28 


1.58 + 0.35 


Leaf litter was sampled from location-specific native tree species corresponding to four functional types (top) varying in quality traits associated with decomposition (bottom). 
* All data are shown as percentage dry mass (mean + s.e.m., n = 5). Methods are described in detail in ref. 31. 
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Extended Data Table 3 | Results of analyses of variance testing for the net diversity effect (NE), complementarity effect (CE) and selection 
effect (SE) on C loss (top) and N loss (bottom) from decomposing leaf litter* 


Net diversity Complementarity Selection 
Source of variation 
DFye Fue Pye DFce Fee Pee DFse Fee Pse 


Effect on C loss 


N-fixer (NF) 1 3.53 0.06 4 0.64 0.42 4 0.06 0.80 
Rapid decomposer (RD) 1 0.41 0.52 4 0.51 0.48 4 0.05 0.82 
Slow decomposer (SD) a 0.28 0.59 1 0.17 0.68 4 0.11 0.75 
Evergreen (EG) 1 0.02 0.90 1 0.61 0.43 4 5.60 <0.05 
NF x RD 1 5:25 <0.05 4 2.35 0.13 4 7.95 <0.01 
RD x EG 1 0.63 0.43 4 0.00 0.99 4 7.64 <0.01 
Decomposer comm. (DC) 2 0.40 0.67 2 1.11 0.33 2 3.29 <0.05 
NF x DC 2 4.40 <0.05 2 1.99 0.14 2 2.60 0.07 
RD x DC 2 5.09 <0.01 2 3.09 <0.05 2 3.59 <0.05 
SD x DC 2 5.57 <0.01 2 4.43 <0.05 2 0.69 0.50 
NF x SD x DC 3 0.41 0.74 3 0.28 0.83 3 5.08 <0.01 
Location’ 4 11.68  <0.001 4 11.08  <0.001 4 52.60 <0.001 
NF x Location 4 1.98 0.10 4 4.12 <0.01 4 5.41. <0.001 
RD x Location 4 3.58 <0.01 4 2:55 <0.05 4 4.11 <0.01 
SD x Location 4 0.52 0.72 4 2.25 0.06 4 11.46  <0.001 
EG x Location 4 1.51 0.20 4 2.17 0.07 4 13.63. <0.001 
NF x RD x Location 4 1.11 0.35 4 2.15 0.07 4 3.02 <0.05 
RD x EG ~ Location 4 173 0.14 4 1.67 0.15 4 3.08 <0.05 
DC x Location 8 493  <0.001 8 4.36  <0.001 8 7.91 <0.001 
NF x DC x Location 8 4.16  <0.001 8 2.28 <0.05 8 2.36 <0.05 
RD x DC x Location 8 2.26 <0.05 8 1.54 0.14 8 1.57 0.13 
SD x DC ~ Location 8 0.74 0.65 8 1.71 0.09 8 443  <0.001 
NF x SD x DC x Location 12 0.91 0.53 12 1.12 0.34 12 3.26 <0.001 
RD x SD x DC x Location 15 0.79 0.69 15 0.83 0.65 15 1.97 <0.05 
Ecosystem type (ET)! 1 10.9  <0.001 4 15.8  <0.001 4 5.23 <0.05 
RD x ET 1 0.45 0.50 4 0.57 0.45 4 22.17 <0.001 
SD x ET 4 oe | <0.05 4 1.99 0.16 4 5.42 <0.05 
EG x ET 1 1.70 0.19 4 5.17 <0.05 4 9.74 <0.01 
RD x SD x ET 4 0.89 0.35 4 1.52 0.22 4 5.42 <0.05 
ET x Location’ 4 11.12 <0.001 4 15.29 <0.001 4 8.58  <0.001 
DC x ET 2 1.18 0.31 2 1.92 0.15 2 4.59 <0.05 
DC x Location x ET 8 4.02 <0.001 8 4.30 <0.001 8 5.81 <0.001 
NF x DC x Location x ET 15 2.09 <0.01 15 1.53 0.09 15 0.98 0.46 
RD x DC x Location x ET 14 1.36 0.16 14 1.30 0.20 14 1.90 <0.05 
Block 40 5.43 <0.001 40 5.38  <0.001 40 2.10  <0.001 
Residuals 1437 1437 1437 
Effect on N loss 

N-fixer (NF) 1 0.63 0.43 4 0.02 0.88 4 0.22 0.63 
Rapid decomposer (RD) 1 2.97 0.09 1 0.59 0.44 1 3.52 0.06 
Slow decomposer (SD) 1 12.62  <0.001 4 7.93 <0.01 1 16.20 <0.001 
Evergreen (EG) 1 0.04 0.83 4 1.44 0.23 4 1.27 0.26 
SD x EG 1 0.20 0.65 4 8.67 <0.01 4 3.78 <0.05 
Decomposer comm. (DC) 2 4:22 0.30 2 5.75 <0.01 2 3.39 <0.05 
NF x DC 2 1.30 0.27 2 4.27 <0.05 2 0.76 0.47 
RD x DC 2 4.23 <0.05 2 9.06 <0.001 2 5.90 <0.01 
SD x DC 2 25.25 <0.001 Z 11.72 <0.001 2 23.30 <0.001 
Location’ 4 6.28  <0.001 4 5.32  <0.001 4 7.01 <0.001 
NF x Location 4 5.33  <0.001 4 0.68 0.61 4 1.93 0.10 
SD x Location 4 5.87 <0.001 4 2.56 <0.01 4 5.36  <0.001 
EG ~ Location 4 3.80 <0.01 4 1.84 0.12 4 3.22 <0.05 
DC x Location 8 11.30 <0.001 8 14.03 <0.001 8 15.08  <0.001 
NF x DC x Location 8 3.18 <0.01 8 3.18 <0.01 8 2.44 <0.05 
RD x DC ~x Location 12 3.48  <0.001 12 3.68  <0.001 12 3.92 <0.001 
SD x DC x Location 8 2.56 <0.01 8 6.93 <0.001 8 6.68  <0.001 
Ecosystem type (ET)' 1 61.29 <0.001 4 49.46 <0.001 4 82.66  <0.001 
ET x Location’ 4 16.42 <0.001 4 19.68  <0.001 4 20.08  <0.001 
NF x ET 1 10.46 <0.01 4 1.31 0.25 4 9.47 <0.01 
NF x Location x ET 4 8.33  <0.001 4 4.12 <0.01 4 6.93 <0.001 
NF x RD x ET 2 6.37 <0.01 2 2.10 0.12 2 3.89 <0.05 
SD x ET 1 4.54 <0.05 4 30.09  <0.001 4 15.92 <0.001 
SD x Location x ET 4 10.69 <0.001 4 17.78 <0.001 4 17.21 <0.001 
NF x SD x Location x ET 10 2.19 <0.05 10 5.03 <0.001 10 5.21 <0.001 
EG ~ Location x ET 5 215 <0.05 5 3.42 <0.01 5 2.40 <0.05 
DC x ET 2 15.11 <0.001 2 21.04  <0.001 2 19.83  <0.001 
RD x DC x ET 2 0.17 0.85 2 7.70  <0.001 2 3.63 <0.05 
DC x Location x ET 8 5.96  <0.001 8 7.90 <0.001 8 6.70 <0.001 
NF x DC x Location x ET 10 2.81 <0.01 10 1.59 0.11 10 1.01 0.43 
SD x DC x ET 2 5.26 <0.01 2 8.19  <0.001 2 9.11 <0.001 
SD x DC x Location x ET 8 4.17  <0.001 8 4.52  <0.001 8 5.78  <0.001 
Block 40 2.55 <0.001 40 2.44  <0.001 40 1.80 <0.01 
Residuals 1460 1460 1460 


*|nteraction terms omitted from the final model are not significant for any of the three response variables. 
+ Location, ecosystem type and their interaction were tested against the block rather than against the residual. 
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Extended Data Table 4 | Characteristics of stream macroinvertebrate communities at the five tested locations* 


Location 


Subarctic 


Boreal 


Temperate 


Mediterranean* 


Tropical 


*The mean density and 


Detritivores Detritivore community composition’ Predators 
Density Taxon Plecoptera Trichoptera Gammarus Density Taxon ‘ 
@m?) fickiness (%) (%) (%) Dominant taxa (m?) ered Dominant taxa 
46.2 5 95.7 4.3 Leuctra digitata 40.0 7 aes, 
+128 +4.8 £48 . Capnopsis schilleri $20.4 Rhyacophite, Diura 
nanseni 
Protonemura meyeri, 
i 5 i Leuctra digitata A acophila nubila 
Gammarus pulex, Dugesia 
332.3 13 4.3 10.2 82.3 Chaetopteryx villosa, 130.7 13 gonocephala, 
+62.6 +5.8 +16.1 413.7 Sericostoma 147.4 Plectrocnemia 
personatum conspersa 
Leuctridae, Perlodidae, 
432.1 10 62.9 36.1 1.0 Taeniopterygidae, 418.3 11 Athericidae, 
+330.6 +26.2 +26.6 2.2 Nemouridae, +239.1 Limoniidae, 
Limnephilidae Rhyacophilidae 
8.6 3 7 21.4 . Phylloicus, 12.3 4 Perlidae, 
14.7 +39.3 Pyralidae, Tipulidae +22.8 Calopterygidae 


‘otal taxon richness of detritivores, their main invertebrate predators, and the mean proportion of Plecoptera, Trichoptera and Gammarus as a percentage of total detritivore abundance 


(mean = s.d.) are shown. All samples were collected using a 500-11m mesh net at the same time of year as the main experiment (although in different years in some cases). Specific sampling protocols differed 


0.3m x 5m, in October 


tween locations, with the density standardized to the number of individuals per metre squared. For the subarctic site, six replicate kick samples were taken, each from an area of 1 m X 0.35 m for 1 min, during 
September 2006. Identification was mostly to the species level®*. For the boreal site, four replicate Surber samples per year were taken for three years, during October 2010-2012, with a quadrat size of 
0.25 m x 0.5 m. Identification was mostly to the species level (B.G.M. and P.-O. Hoffsten, unpublished observations). For the temperate site, five replicate sweep net samples were taken, each from an area of 


992. Identification was mostly to the species level®°. For the Mediterranean site, five replicate Surber samples were taken, with a quadrat size of 0.33 m x 0.31 mm, in January 2014. 


dentification was mostly to the family level (E.C. and S. Lamothe, unpublished observations). For the tropical site, ten replicate natural leaf packs (fist-sized handfuls of leaves picked from the stream bed) were 
taken from each of seven streams in May 2007. Abundances per leaf pack were converted to densities based on standardized visual estimates of stream-bed litter cover. Identification was mostly to the family level 
(A.B., M. Schindler, M. S. Moretti and M.O.G., unpublished observations). 
Detritivore community composition data do not sum to 100% at all locations, owing to the presence of other dipteran (Tipulidae), lepidopteran (Pyralidae) and crustacean (Asellidae) shredders at the temperate 
site, and tipulid and pyralid shredders at the tropical site. 
The caddisfly Micrasema (Brachycentridae) was common at the Mediterranean site but was small and was not regarded as a shredder. 
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Extended Data Table 5 | Full model output of the relative contributions of variance associated with diversity and sites to explain C and N loss 


Source of variation DF C loss N loss 
%SS F P %SS F P 
Variation associated with diversity 
Litter community 
Litter richness’ 1 ) 0.01 0.93 1.0 63.7 se 
Litter diversity’ 2 0.1 2.61 0.07 0.3 8.29 vei 
Variation associated with 11 10.0 77.5 “ 8.6 50.9 sisi 
functional type (FT) composition* 
Decomposer community (DC) 2 5.8 247 ala 44 142 nen 
Litter community x DC 
Richness x DC 2 0 0.37 0.69 0.1 3.45 . 
Diversity x DC 4 0 0.70 0.59 0.0 0.07 0.99 
FT composition x DC 22 0.8 2.96 — 12 3.61 ad 


Variation associated with sites 


Location 4 124 52.8 — 16.9 66.5 = 
Richness x Location 4 0.1 2.07 0.08 0.3 5.06 one 
Diversity x Location 8 0.2 2.35 . 0.3 2.37 . 
FT compositions x Location 44 15.8 30.6 waial 10.4 15.4 cial 
DC x Location 8 8.6 92.1 nee ye] 61.3 one 
Richness x DC x Location 8 0 0.51 0.85 0.1 0.53 0.83 
Diversity x DC x Location 16 0.3 1.36 0.15 0.2 0.84 0.64 
FT composition x DC x Location 88 1.6 1.56 ial 17 1.28 * 
Ecosystem 4 6.4 113 on 1.4 22.3 wee 
Location x Ecosystem 4 6.3 27.7 aed 8.0 31.6 acu 
Richness x Ecosystem 4 0.1 5.79 * 0 1.02 0.31 
Richness x Location x Ecosystem 4 0.1 1.25 0.29 0.2 3.21 * 
Diversity x Ecosystem 2 0 0.62 0.54 0 1.27 0.28 
Diversity x Location x Ecosystem 8 0.2 2.37 * 0.1 1.11 0.35 
FT Composition x Ecosystem 11 1.0 7.94 = 1.4 8.30 = 
FT Composition x Location x Ecosystem 44 AT 3.30 —_ 1.9 2.81 — 
DC x Ecosystem 2 1.9 79.7 aided 0.9 27.7 = 
DC x Location x Ecosystem 8 2.4 25.7 = 1.9 15.9 — 
Richness x DC x Ecosystem 2 0 0.36 0.70 0 0.60 0.55 
Richness x DC x Location x Ecosystem 8 0.1 0.70 0.69 0 0.33 0.96 
Diversity x DC x Ecosystem 4 0 0.35 0.84 0.1 1.40 0.23 
Diversity x DC x Location x Ecosystem 16 0.1 0.38 0.99 0.2 0.79 0.70 
FT Composition x DC x Ecosystem 22 0.3 1.14 0.30 0.4 1.08 0.36 
FT Composition x DC x Location x Ecosystem 88 1.2 TAT 0.13 12 0.92 0.69 
Block (within location x ecosystem) 40 2.3 4.87 ial 2.5 4.13 — 
Total variance explained by the model 489 79.6 13.9 oe 73.3 9.76 — 
Residuals 1739 20.4 26.7 


Total 2228 100.0 100.0 


Variance associated with diversity and sites is expressed in percentage sums of squares (% SS); levels of significance are *P < 0.05, **P<0.01, ***P<0.001. 

+ Plant species were selected to represent the same four functional types (FTs) at each location (N-fixing, evergreen, rapidly decomposing deciduous trees or shrubs, or slowly decomposing deciduous trees or 
shrubs). Litter FT richness (linear contrast) and litter diversity (factorial contrast) were fitted before litter FT compositions. 

$Analternative model omitting richness and testing in detail the litter FT compositions in a full factorial analysis with contrasts for FT presence/absence and interactions is presented in Extended Data Table 7. That 
model highlights the importance of the interaction between the litter of the N-fixing FT and the rapidly decomposing FT, hinting at a N-transfer mechanism. 
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Extended Data Table 6 | Characteristics of soil fauna communities at the five tested locations* 


Location Mesofauna’ 
Microbi-detritivores Dominant taxa* Predators Dominant taxa* 
Density Taxon Collembola : Density Taxon 
(#m”) richness’ (%) Aeeri'(%) (#m°) richness’ 
Subarctic 4068 46.9 53.1 506 , 
$2254 5 413.0 +13.0 +209 ; clase 
Boreal 9824 61.9 38.1 690 . 
£3750 ba $14.9 +14.9 +356 : Ceneaea 
Temperate 17020 ‘a 75.0 25.0 711 ‘ Gamasida (73%) 
+6846 +8.3 $8.3 £347 Diplura (27%) 
Mediterranean 9222 10 21.9 78.1 1386 2 Gamasida (64%) 
+6514 +6.9 +6.9 +598 Diplura (36%) 
Tropical 641 6 3.9 96.0 276 3 Gamasida (47%) 
+134 23,7 43.7 +106 Diplura (53%) 
Location Macrofauna’ 
Detritivores Dominant taxa* Predators Dominant taxa* 
Density Taxon Density Taxon 
(# m7”) richness’ (#m*) richness’ 
Subarctic 595 10 Diptera (68%) 588 9 Araneidae (41%) 
+249 Gastropoda (25%) +197 Diptera (35%) 
Boreal 333 g Diptera (82%) 1824 " Diptera (73%) 
+185 Annelida (13%) +1674 Coleoptera (20%) 
Temperate 81 Diplopoda (60%) 3640 il 
+59 6 Gastropoda (23%) $1534 6 Chllopoda (20%) 
Mediterranean 501 10 lsopoda (43%) 2244 16 Chilopoda (51%) 
+219 Diptera (38%) +2021 Hymenoptera (30%) 
Tropical 1750 10 lsoptera (34%) 1813 13 Hymenoptera (59%) 
+3115 Diptera (26%) #1515 Pseudoscorpionida (32%) 


*The mean density, the total taxon richness and the proportion of dominant taxa as a percentage of total community abundance are shown (mean = s.d.). Communities are divided into mesofauna and 
macrofauna, reflecting an increase in body size that relates to mesh size differences in the field microcosms. 

+ Taxon richness is based on the number of observed families. 

+ Dominant taxa data are based on a lower taxonomic resolution than taxon richness, mainly order or class level. The community composition data do not always sum to 100% at all locations, owing to the presence 
of other taxa. 

§ All samples were collected at the end of the growing season in 2008 (subarctic and boreal, late September; temperate and Mediterranean, October; and tropical, early December). Eight Kempson cores (21-cm 
diameter) and eight MacFayden cores (5-cm diameter) were taken at each field site. The reported data are based on extraction of the whole soil core (9-cm height), including the litter layer. Soil arthropods were 
extracted, counted and identified to the highest possible taxonomic level (families) (O.B. and S.S., unpublished observations). 
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Extended Data Table 7 | Analysis of variance testing for effects on total litter C loss (top) and N loss (bottom) 


Source of variation DF ss F % Sum of 

Squares 

% C Loss 
Rapid decomposer (RD) 1 16580 290.7 3.9 
Slow decomposer (SD) 1 12068 211.6 29 
N-fixer (NF) 1 3146 §5.2 0.7 
Evergreen (EG) 1 3576 62.7 0.9 
RD x NF 1 3532 61.9 0.8 
RD x EG 1 747 13.1 0.2 
SD x EG 1 1242 21.8 0.3 
RD x SD : 1 223 3.9 0.1 
Decomposer community 1 23404 410.3 5.6 
Decomposer community x NF 1 907 15.9 0.2 
Decomposer community x SD 1 526 9.2 0.1 
Location’ 4 50698 51.5 12.1 
Location x EG 4 26247 115.0 6.2 
Location x RD 4 13398 58.7 3.2 
Location x SD 4 7784 34.1 1.9 
Location x NF 4 8818 38.7 21 
Location x RD x EG 4 1272 5.6 0.3 
Location x RD x SD 4 1686 74 0.4 
Location x SD x EG 4 976 43 0.2 
Location x Decomposer community 4 31306 137.2 7.5 
Location x Decomposer community x SD 4 1144 5.0 0.3 
Ecosystem type (ET)' 1 27126 110.3 6.5 
ET x Location’ 4 26703 27.1 6.4 
ET x Location x RD 5 1330 47 0.3 
ET x Location x SD 5 1860 6.5 0.4 
ET x NF 1 446 7.8 0.1 
ET x Location x NF 4 554 24 0.1 
ETxEG 1 3450 60.5 0.8 
ET x Location x EG 4 2318 10.2 0.6 
ETxRDxSD 1 262 46 0.1 
ET x Decomposer community 1 7761 136.1 1.8 
ET x Location x Decomposer community 4 8908 39.0 aA 
Block 40 9836 43 2.3 
Residuals 2106 120133 28.6 
% N Loss 

Rapid decomposer (RD) 1 16570 87.7 1.4 
Slow decomposer (SD) 1 31927 169.0 reg 
N-fixer (NF) 1 40390 213.7 3.5 
Evergreen (EG) 1 3021 16.0 0.3 
RD * NF 1 13245 70.1 14 
RD x EG 1 2697 14.3 0.2 
SD x NF 1 971 5.1 0.1 
NF x EG 1 2438 12.9 0.2 
RD x NF x EG 1 2947 15.6 0.3 
Decomposer community" 1 48999 259.3 4.2 
Decomposer community x NF 1 5281 28.0 0.5 
Decomposer community x SD 1 1080 5.7 0.1 
Decomposer community x EG 1 771 44 0.1 
Decomposer community x NF x SD 1 1071 Le 0.1 
Decomposer community x NF x EG 1 2051 10.9 0.2 
Location’ 4 195881 66.7 16.8 
Location x Decomposer community 4 75781 100.3 6.5 
Location x NF 6 46245 61.2 4.0 
Location x RD 4 30727 40.7 2.6 
Location x EG 4 20789 27.5 1.8 
Location x SD 4 4815 6.4 04 
Location x NF x EG 4 10603 14.0 0.9 
Location x RD x EG 4 3757 5.0 0.3 
Location x NF x RD 4 3346 44 0.3 
Location x Decomposer community x EG a 2632 3.5 0.2 
Ecosystem type (ET)' 1 16565 22.6 14 
ET x Location’ 4 92914 31.7 8.0 
ETxRD 1 5845 30.9 0.5 
ET x NF 1 1207 6.4 0.1 
ET x Location x NF 4 9700 12.8 0.8 
ETxEG 1 4173 22.1 0.4 
ET x Location x EG 4 3870 5.1 0.3 
ET xNFxRD 1 2554 13.5 0.2 
ET x Location x RD x NF 4 1908 25 0.2 
ET xNF x EG 1 1419 75 0.1 
ET x Decomposer community 1 9015 47.7 0.8 
ET x Location x Decomposer community 4 17968 23.8 i 
Block 40 29349 3.9 2.5 
Residuals 2106 397977 34.2 


All terms included in the final model shown are significant at P< 0.05. 
*The decomposer community was fitted as a log-linear contrast and not a factorial contrast (a factorial contrast is shown in Table 1). 
+The location and ecosystem type and their interaction were tested against the block rather than against the residual. 


©2014 Macmillan Publishers Limited. All rights reserved 


LETTER 


Extended Data Table 8 | Analysis of variance testing for the proportional change in total litter N content 


Source of variation DF ss F P 
Mixture (decomposing alone or in presence of the other functional type) 1 0.002 0.057 0.81 
Functional type (N-fixing or rapidly decomposing) 1 0.650 15.8 <0.001 
Mixture x Functional type 1 0.290 7.05 <0.01 
Location’ = 2.053 11.9 <0.05 
Ecosystem type (ET)' 1 0.818 18.9 <0.05 
ET x Location! 4 2.277 13.1 <0.05 
Block 4 0.173 1.05 0.38 
Residuals 574 23.61 


The test compares specific two-species combinations, including the particular functional types of N-fixing and rapidly decomposing plants, to their respective single-species treatment across location and 
ecosystem type. 

* A significant difference in the mixture x functional type interaction is taken as an indication of N transfer between litter species. 

+ Location and ecosystem type and their interaction were also included as factors in this analysis and were tested against the block rather than the residuals. 
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Extended Data Table 9 | Experimental duration and richness of naturally occurring local litter species in terrestrial and aquatic ecosystems at 
each of five widely dispersed locations 


Ecosystem Location apn eer — iter species 
richness 
Terrestrial Subarctic 22/ 09/ 06 5/ 09/ 08 714 6 
Boreal 20/ 10/ 06 12/ 05/ 08 570 4 
Temperate 17/01/07 5/ 03/ 08 414 4 
Mediterranean 15/ 11/07 19/ 10/ 08 338 4 
Tropical 20/ 11/ 06 25/ 08/ 07 278 7 
Aquatic Subarctic 27/ 07/ 07 11/10/07 76 6 
Boreal 11/ 09/ 07 13/11/07 63 4 
Temperate 18/ 10/07 13/12/07 56 4 
Mediterranean 16/ 11/07 3/ 01/ 08 48 11 
Tropical 9/ 05/ 07 14/ 08/ 07 97 10 


* Incubation dates differed across ecosystem types and locations to ensure that, at the time of sampling, 40-50% of the mass of the most rapidly decomposing litter remained, thus allowing comparisons at similar 
decomposition stages. 


+The mean species richness counts of naturally occurring litter in five randomly sampled plots that were the size of microcosms (15-cm diameter) in each of the five experimental blocks. 
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Niche filling slows the diversification of 


Himalayan songbirds 


Trevor D. Price’, Daniel M. Hooper’, Caitlyn D. Buchanan!, UlfS. Johansson’, D. Thomas Tietze’, Per Alstrém*°, Urban Olsson®, 
Mousumi Ghosh-Harihar’, Farah Ishtiaq’, Sandeep K. Gupta’, Jochen Martens®, Bettina Harr’, Pratap Singh’ & Dhananjai Mohan’ 


Speciation generally involves a three-step process—range expansion, 
range fragmentation and the development of reproductive isola- 
tion between spatially separated populations’”. Speciation relies on 
cycling through these three steps and each may limit the rate at which 
new species form’*. We estimate phylogenetic relationships among 
all Himalayan songbirds to ask whether the development of repro- 
ductive isolation and ecological competition, both factors that limit 
range expansions’, set an ultimate limit on speciation. Based on a 
phylogeny for all 358 species distributed along the eastern elevational 
gradient, here we show that body size and shape differences evolved 
early in the radiation, with the elevational band occupied by a species 
evolving later. These results are consistent with competition for niche 
space limiting species accumulation’. Even the elevation dimension 
seems to be approaching ecological saturation, because the closest 
relatives both inside the assemblage and elsewhere in the Himalayas 
are on average separated by more than five million years, which is lon- 
ger than it generally takes for reproductive isolation to be completed”**; 
also, elevational distributions are well explained by resource avail- 
ability, notably the abundance of arthropods, and not by differences 
in diversification rates in different elevational zones. Our results imply 
that speciation rate is ultimately set by niche filling (that is, ecological 
competition for resources), rather than by the rate of acquisition of 
reproductive isolation. 

Range expansions are a critical step in the speciation cycle: without 
them, allopatric and parapatric forms would have ranges of ever decreas- 
ing size, unlikely to be further fragmented by barriers’. The expansion 
of geographical range by one taxon inevitably brings it into sympatry 
with related taxa, which requires reproductive isolation between the 
forms’ and is generally thought to require ecological differences as 
well’”. In young adaptive radiations reproductive isolation and ecological 
divergence may be coupled'*", with consequent rapid cycling through 
the speciation cycle’ (Extended Data Fig. 1). For example, young spe- 
cies of Darwin’s ground finches (Geospiza) differ in beak and body size 
and coexist in sympatry by exploiting different-sized seeds. These beak 
and body size differences contribute to reproductive isolation, because 
they are used as cues in conspecific mate choice (premating isolation) 
and because intermediate-sized hybrids are at a disadvantage in some 
environments (postmating isolation)’. As adaptive radiations proceed 
and environments fill with species, rates of ecological divergence slow'””” 
and concomitantly the rate of evolution of reproductive isolation declines. 
Evidence from bird hybrid zones on continents implies that species may 
continue to interbreed even when separated by more than two million 
years”’, preventing mutual range expansions‘ and delaying the specia- 
tion cycle’. 

Here, we introduce a method of studying the causes of slowing speci- 
ation rates as species accumulate by considering all species in a sympatric 
continental community. We ask whether limits to range expansions into 
the community are attributable to an absence of reproductive isolation 


or to ecological competition, by considering the age and pattern of diver- 
sification along different ecological dimensions, and by directly eval- 
uating species distributions through field measurements of resources. 
To do this, we built a molecular-based phylogeny for all 461 Himalayan 
songbirds (Fig. 1). The songbirds (or oscines) are one of three suborders 
in the order Passeriformes, which is one of the 39 bird orders, but this 
one suborder contains more than 5,000 species, or over 45% of all the 
world’s birds. Worldwide, they reach their maximum diversity in the east- 
ern Himalayas, where about 358 species breed within our 10,000-km? 
study area (Fig. 1). We consider these species to be sympatric because 
they are within “cruising range” of one another®. They occupy a diver- 
sity of climates, from near-tropical to near-boreal, their body mass varies 
>200-fold (4.5 g to 950 g), their shape differences are extreme (for exam- 
ple, among species weighing about 30 g, beak length varies fivefold) and 
they include specialist nectarivores, frugivores, granivores, insectivores 
and aerial foragers, that is, they bear the hallmarks of an adaptive radi- 
ation. We assume that ecological differences are essential to coexistence. 
Previous work on one subgroup of similar species, the Old World leaf 
warblers (Phylloscopidae, 19 Himalayan species’*), demonstrated the 
importance of ecological differences, with sympatric species differing along 
at least one dimension of feeding method, body size and elevation'*”’. 

Biogeographical reconstructions relate many of the speciation events 
that led to the 358 species in our study area to barrier formation outside 
the Himalayas, including between the Indonesian islands’ and within 
China’’. Thus, the east Himalayan community has been assembled largely 
by immigration from outside, with ecological differences either gener- 
ated in situ, or before invasion. We emphasize relationships within a 
group of sympatric species, rather than within the entire songbird clade, 
for two reasons. First, many of the other species in the songbird clade (for 
example, Darwin’s finches) have originated in response to ecological con- 
ditions elsewhere, so including them would require a complex integration 
of the timing of establishment and patterns of ecological diversification 
across the world. Second, many closely related allopatric species appar- 
ently occupy the same ecological niche in different places. For example, 
Packert et al.'” studied 26 sister species pairs which contained at least 
one species in our study area; they found that in most cases (63%) the 
other sister was a similar, allopatric, replacement (see also Extended Data 
Fig. 2). A collection of closely allopatric replacements that occupy the 
same niche in different places form a superspecies’. Phylogenetic rela- 
tionships among sympatric species thus capture diversification patterns 
among superspecies and hence lineage splits that can potentially be lim- 
ited by niche filling (Extended Data Figs 2 and 3). 

Previous phylogenetic analyses of two subgroups of Himalayan song- 
birds, the Old World leaf warblers’* and separately, the Corvoidea’® 
(57 species) found that differences in elevation generally evolved more 
recently than differences in feeding method and body size. In Fig. 2 we 
show this to be the case for the entire 358 species assemblage. Consis- 
tent with niche filling along the body size and feeding habit dimensions, 
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b_ Nonpasserines 


Oscines 
Figure 1 | Phylogeny and species distributions. a, Red shading indicates the 
area covered by the phylogeny and the rectangle denotes the study area. 

b, Estimate of phylogenetic relationships among all Himalayan songbirds (red), 
suboscines (a second suborder of the Passeriformes, green) and ten orders of 
nonpasserines (black), computed in BEAST” and dated using multiple fossils. 
95% confidence limits on dates based on the Bayesian sample are 


rates of diversification in size and shape have significantly slowed towards 
the present (Extended Data Figs 4 and 5) and clades dating back beyond 
ten million years (Myr) ago occupy restricted morphological space (for 
example, Fig. 2b). In contrast, differentiation along the elevation axis has 
occurred more recently, albeit still many millions of years ago (Fig. 2a, 
Extended Data Fig. 4). Along with these ecological patterns, lineage split- 
ting in the 358 species phylogeny declines precipitously through time 
(Fig. 2, Extended Data Table 1). Strikingly, the average time since sep- 
aration (+ the standard error, throughout) of pairs of close relatives 
(‘sister pairs’ on the assemblage phylogeny) is estimated as 7.1 Myr + 0.3, 
N = 116 pairs (97.5% lower bound based on a sample of trees from the 
a posteriori distribution, 5.6 Myr + 0.26). These results are consistent 
with the idea that niche availability limits the establishment of species 
in the assemblage, with new ecological opportunity” along the eleva- 
tion dimension appearing most recently, perhaps in association with 
documented climate change (and turnover in the mammalian fossil 
record) 6 to 10 Myr ago”. 
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Figure 2 | Morphological evolution. a, Number of lineages at a given time 
slice through the phylogeny (blue solid line) and disparity measures for three 
morphological traits—mass (solid black line) and two shape measures (dashed 
black lines)—and elevation (taken as the mid-point of range, dotted black line) 
for the east Himalayan songbirds. Disparity is a measure of the fraction of 
the total variance that is within the clades subtended by any timeline”. All three 
disparity measures deviate significantly from a Brownian motion model 
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approximately +3 Myr. c, Local abundance of songbirds (oscines) and 
suboscines + nonpasserines in the study area as assessed by overlapping 
published range maps with elevational distributions. Lines are nonparametric 
curve fits. Each point represents the total number of species thought to be 
breeding in a 0.04° X 0.04° square (about 2 km/). 


Although patterns of evolution suggest a dynamic whereby niche- 
filling slows species accumulation, a failure of species to become established 
in the assemblage may instead reflect insufficient time to develop repro- 
ductive isolation between allopatric forms. However, a strong slowdown 
in lineage diversification is present even if the phylogeny is truncated at 
3 Myr ago (Extended Data Table 1), which is a reasonable upper bound 
on the time it takes for reproductive isolation to be completed in nature”®; 
establishment of allopatric forms that are older than this should not 
be limited by reproductive isolation. Further, 85 Himalayan songbird 
species are found to the west of our study area. They are separated by 
an average of 7.5 + 0.38 Myr from their closest relative in the study area, 
and 77 of these 85 species breed alongside this close relative somewhere 
in their western range (overlap of at least 1,500 km). Thus, the failure 
of these 77 species to expand their range into the study area cannot be 
attributed to incomplete reproductive isolation. We directly evaluated 
the niche-filling explanation by studying species richness patterns along 
the elevational gradient. This is the ecological axis associated with the 
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(Extended Data Figs 4 and 5). b, First shape component versus mass. Polygons 
enclose morphologies for those six lineages at 11.5 Myr ago that subtend at least 
six species in the study area. One species from each clade is indicated (clockwise 
from top left Phylloscopus xanthoschistos, Parus monticolus, Acridotheres 
fuscus, Dicrurus paradiseus, Carpodacus rubicilla and Leucosticte brandti). The 
solid points are individual species with no shared relative dating to 11.5 Myr 
ago. Separate analyses for the major subclades are in Supplementary Table 1. 
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most recent divergences (Fig. 2) and hence a priori is the dimension 
most likely to contain additional opportunities for species to differen- 
tiate along. Songbirds peak in abundance at an altitude of about 1,800 m 
in the east Himalayas (Fig. 1). We ask why this should be. 

The most popular model that incorporates the slow development of 
reproductive isolation as a potential explanation for patterns of species 
richness along elevational gradients considers that each climatic zone 
independently accumulates species, and different zones accumulate spe- 
cies at different rates’. In this model, species and clades present in one 
zone remain confined to that zone because they are poorly adapted to 
alternative climates (climatic niche conservatism’’). In contrast to the 
predictions of this model, we found that the average age of separation 
of species in the assemblage declines monotonically with elevation, rather 
than being lowest in the most species-rich elevational band (Fig. 3a, 
Extended Data Fig. 6). Further, even at the high elevations, the plot of 
lineage diversity versus time shows an exceptionally strong slowdown, 
with close relatives separated by millions of years (Extended Data Fig. 6), 
suggesting that plenty of time has been available to accumulate addi- 
tional species in this zone. Finally, climatic niche conservatism along the 
elevational gradient is weak, with close relatives often found in different 
zones (see the elevation disparity curve in Fig. 2 and ref. 21). Therefore, 
there is little support in these data for climatic niche conservatism: newly 
formed species should be able to transition rapidly between climatic 
zones in the absence of any other limits. 

In models of niche filling, more species are found where resources 
are either more diverse or more abundant”. We tested the resource diver- 
sity hypothesis as an explanation for species distribution along the eleva- 
tional gradient by using morphological diversity as a surrogate”. We 
found that morphological diversity increases from the lowest to inter- 
mediate elevations, as may be expected given the increase in species 
numbers. However, morphological diversity increases still further to 
the highest elevations, despite the relatively few species at these eleva- 
tions (Fig. 3b, Extended Data Fig. 7). Exploitation of a relatively high 
diversity of resources at high elevations has been attributed to a release 
from competition from the nonpasserines’’, which predominate at lower 
elevations (Fig. 1c) and may also reflect the presence of additional open 
country habitats. Whatever the underlying causes, the monotonic increase 
in the apparent diversity of resources exploited does not match the mid- 
elevation peak in species richness. 

Finally, in a hypothesis of resource abundance a “poor environment 
supplies too meager a resource base for its would-be rarest species, and 
they become extinct” (page 56 of ref. 24). To investigate this, over six years, 
we established eighteen 5-hectare grids in forested habitats at all alti- 
tudes across the study area (Extended Data Figs 8 and 9). We censused 
breeding birds, classified them according to feeding habit, measured 
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Figure 3 | Phylogenetic and morphological diversity along the elevational 
gradient. a, Estimate of phylogenetic relatedness” for all songbirds in a given 
elevational band; the line is the least-squares regression. On the scale, a value 
of 1 would imply a star phylogeny (that is, all species have independent history), 
whereas a value close to 0 implies that many species are closely related to 
each other. b, Volume of the convex hull for three morphological dimensions 
(see Extended Data Fig. 5a), as occupied by all songbirds in a given elevational 
band (the line is a nonparametric line fit; the outlier Xiphirhynchus superciliaris 
is omitted from this plot; its inclusion greatly steepens the slope). 
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Figure 4 | Species richness and resource abundance. a, Number of breeding 
species censused on fifteen 5-hectare plots. b, Arthropod densities (from ref. 15, 
plus additional data (grey points)). Lines are nonparametric curve fits. 


vegetation parameters and, for a subset of grids, we also measured 
arthropod densities. The mid-elevation peak in these 5-hectare plots 
(Fig. 4a) is very similar to that found at the larger scale (Fig. 1c). We found 
that this peak is entirely attributable to insectivorous species (Fig. 4a) and 
strikingly matches arthropod abundance (Fig. 4b). Arthropod abund- 
ance correlates with the total number of insectivorous individual birds 
on the 5-hectare grid (from 44 to 218; Pearson’s correlation r = 0.76, 
the correlation with insectivorous bird biomass is r = 0.75) and the more 
insectivorous birds on the 5-hectare grid, the more insectivorous species 
were present (r = 0.93). A mid-elevation peak in arthropod abundance 
is surprising, given that both rainfall and temperature monotonically 
decline with elevation (Extended Data Fig. 8). However, mid-elevations 
are the lowest elevations that regularly freeze in winter and are there- 
fore likely to experience a large leaf and arthropod flush in the spring. 
In addition, the lowest elevations contain many other insectivorous ani- 
mals, potentially in competition with the songbirds. These include non- 
passerine and suboscine birds (Fig. 1), an insectivorous ant (Oecophylla 
smaragdina, which is abundant at low elevations but absent at mid- 
elevations, personal observation of D.M.H. in 2011) and probably other 
poikilotherms. 

In this continental environment, where species have had time to accu- 
mulate and dispersal barriers are relatively weak, both the appearance of 
new species and morphological diversification have substantially slowed 
and species distributions are well explained by abundance of resources. 
Rabosky and Matute* have demonstrated that, across clades, rates of 
evolution of reproductive isolation are generally decoupled from diver- 
sification rates. Our results imply that the ultimate limit on diversifica- 
tion is best explained, not by a slow rate of accumulation of reproductive 
isolation, but by the failure of species to expand ranges into new local- 
ities, which we attribute to competitive interactions. Because range expan- 
sions are essential to most ongoing speciation events’, local interactions 
ultimately determine regional speciation rates, rather than regional speci- 
ation rates setting local diversity, as in many non-equilibrium models”. 
These results contrast with those from young environments, such as 
young archipelagoes''—which provide some of the best evidence for 
ongoing adaptive radiations—and with those from dispersal-limited, 
extinction-prone environments, notably islands—which provide some 
of the best evidence for bird communities below saturation”. 


METHODS SUMMARY 


Tree construction. We obtained mitochondrial sequence data for all but one pas- 
serine species and nuclear data for 89%. We estimated a tree for the passerines and 
separately for 133 nonpasserines (all but 15 Himalayan species) from the following 
orders—the Piciformes, Bucerotiformes, Upupiformes, Trogoniformes, Coracii- 
formes, Cuculiformes, Psittaciformes, Apodiformes, Strigiformes and Columbiformes. 
For the primary analyses we constructed an ultrametric tree in BEAST”, dated 
using multiple fossils and biogeographic dates, all set with uniform priors. 

Morphology. We measured two or more specimens of each species in museums, 
drawing on males from within the study area whenever possible. We measured beak 
depth, beak length, beak width, tarsus length, and wing length. Mass and elevational 
distributions were based on the primary literature and our own fieldwork. In the 
main analyses we used In(mass) and the first two principal components extracted 
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from the correlation matrix of residuals of regressions of In(morphological traits) 
on In(mass), all standardized to have unit variance (alternative approaches give 
similar results; see Extended Data Fig. 5). 

Fieldwork. T.D.P. and D.M., plus up to two other experienced workers, indepen- 
dently censused birds in eighteen 5-hectare grids spanning the elevational gradient. 
In each grid two mornings were spent territory-mapping males, largely following the 
British Trust for Ornithology’s common bird census protocols (http://www.bto.org) 
and assuming that the number of birds was twice the number of censused males. 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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METHODS 


Conceptual basis and null models. This paper builds an estimate of the phyloge- 
netic relationships of all members of a clade that are found in a locality, rather than 
the clade in total. Previous studies of adaptive radiations, defined here as diver- 
gence froma single ancestor to fill a variety of ecological niches™, have studied clades, 
not communities, but these do not directly allow an assessment of how ecological 
differentiation—the essence of adaptive radiation—controls diversification and 
speciation. 

The conceptual framework is illustrated in Extended Data Fig. 1, and follows 
refs 2, 3 and 35. Although adaptive radiation in young environments has been well 
studied, and connections between ecological diversification and reproductive iso- 
lation well established**, little work has been done in older environments, which 
are a priori more likely to be ecologically full. As shown in Extended Data Fig. 1, 
the predictions are that speciation slows when ecological niches are filled, and the 
reason for this is that range expansions into sympatry become increasingly diffi- 
cult. Range expansions may also slow because reproductive isolation is acquired 
more slowly when ecological opportunity is reduced (that is, different populations 
are now not under selection to adapt to different environments because niche 
space is increasingly filled up). However, even if populations are not subject to any 
divergent selection pressures at all, they should eventually differentiate to the level 
of full species. At that point, range expansions may remain difficult because of com- 
petition from the sister species (step 4 in Extended Data Fig. 1), as well as from other 
species in an environment where niches are increasingly occupied. 

Our conclusions are that east Himalayan birds have approached this last step. If 
sympatric species were about the same age as the attainment of reproductive isola- 
tion, incomplete reproductive isolation may be limiting species accumulation. How- 
ever, the average time to the common ancestor of pairs of close relatives (sisters on 
the assemblage tree) living in the sympatric assemblage is long (7 Myr). By con- 
trast, postmating reproductive isolation in nature may typically be completed by 
3 Myr since separation from their common ancestor (although complete loss of 
fertility in hybrids can evidently take longer than this*) because, first, many dele- 
terious effects in hybrids are apparent by that time, which will lower fitness in the 
wild, and, second, hybrids between species often fall between niches occupied by 
the parental forms*’®"’”*. This reasoning applies to postmating isolation and it is 
possible that premating isolation remains incomplete. If so, complete loss of hybrid 
fitness plus ongoing hybridization would be a powerful means of setting range 
limits*. However, hybrid zones between species separated by more than 3 Myr are 
rare”*. More specifically, in our extensive population studies of thousands of breed- 
ing pairs of songbirds in the western Himalayas”, we have never observed hybrid- 
ization events between species separated by more than 4 Myr. Further, we noted in 
the main text that 90% of the species absent from the study area do overlap with 
their closest relative in the study area somewhere in the Himalayas, implying that 
premating reproductive isolation is essentially complete between them. 

We bolster the argument of limits through niche-filling by building a time-dated 
phylogeny ofall species in the community, and find both a substantial slowdown in 
per-lineage rates of splitting within the phylogeny and a slowdown in morpho- 
logical evolution over time. Both of these results are consistent with niche avail- 
ability limiting diversification, but they need to be evaluated against a suitable null 
model: 

Lineage splitting. A slowdown in lineage splitting refers to a less than exponential 
increase in species, as expected from a pure birth (Yule) model (a model with con- 
stant speciation and extinction leads to a greater-than-exponential increase™*). A 
slowdown in lineage splitting has been observed in many studies of large clades” 
and has sometimes been interpreted as a consequence of niche-filling”’, as we do 
here. One bias is that there is a lag time to the production of allopatric forms that 
will then go on to be species, with the result that some of the recent splits in the phy- 
logeny are not included when they should be*”*"*. Previous work has addressed 
this difficulty by simply lopping off the most recent branch length (for example, 
refs 39 and 41). An advantage of working with sympatric assemblages is that spe- 
cies are defined unambiguously, so the lag time to species recognition is not an 
issue. However, given allopatric/parapatric speciation, there must bea delay to sym- 
patry that includes the following features: (1) development of reproductive isolation 
between allopatric forms, (2) dispersal across any barriers, and (3) establishment in 
the community. As can be seen from the plot of lineage diversification versus time in 
Fig. 2, and as quantified in Extended Data Table 1, a strong slowdown remains even 
if the phylogeny is truncated 3 Myr ago, which is surely longer than an appropriate 
null model for how long a lag time should be (it is longer than it has taken to produce 
the entire Darwin’s finch radiation’, which contains up to 11 sympatric species). 

Sampling. One set of null models considers that the Himalayan assemblage repre- 
sents a random sample from some larger region in which the Himalayas are embed- 
ded. A random sample of species from the tips results in a greater slowdown than is 
observed in the larger phylogeny, because deep nodes are generally captured, but shal- 
low nodes often will not be*’. Whether this is an appropriate null model is debatable 


(see below) but we test it in this section. We consider sampling from both all Asian 
oscines and the pure birth model, and find the downturn in the study area to be 
much stronger than expected on the basis of random sampling from these larger 
species pools, even if the plot of lineage versus time is truncated at 3 Myr ago (Ex- 
tended Data Table 1). In addition, trees sampled from a larger species pool (based 
on the tree of Jetz et al.*') have ages of sisters not greatly different from those in the 
larger pool (4.5 Myr compared to about 4 Myr in the analyses of Extended Data 
Table 1), which is substantially shorter than is observed in the sympatric assemblage 
(see the ‘Phylogeny construction’ section, in which the tree of ref. 31 is compared to 
the oscine tree illustrated in Fig. 1b). 

Given that neither a 3-Myr lag time to sympatry nor sampling can account for 

the strong slowdowns, we argue that the rejection of a constant diversification rate 
in the phylogeny of sympatric species results because more recently produced allo- 
patric species find it increasingly difficult to invade the community, plus possibly 
there has also been a slowdown in the global production of species, which, by 
induction, is itself ultimately driven by the failure of range expansions. Because 
over 1,500 songbird species are present in Asia but only 358 are present in the 
study area, a failure of establishment (range expansions) clearly accounts for some 
of the long lag times. 
Alternative null models. Despite the analyses reported in Extended Data Table 1, 
we suggest that random sampling from a larger phylogeny is not an appropriate 
null model if the goal is to assess ecological controls on species diversification. This 
is because most related allopatric forms reflect either ecological replacements, or 
responses to ecological diversity in other locations of the world and therefore are 
not part of an assemblage responding to local ecological controls. We illustrate the 
reasoning by considering one species present in the Himalayas, Zosterops palpeb- 
rosus, together with its closest sequenced relatives, as taken from the tree of ref. 31 
(Extended Data Fig. 2). 

Most of the members of the Z. palpebrosus clade (the clade containing the upper 
ten species in Extended Data Fig. 2) consist of allopatric forms, some of whom are 
quite young and may not be reproductively isolated, and all of which occupy an 
arboreal fruit-, flower- and insect-eating niche, and commonly forage in flocks, that 
is, are ecological equivalents in different locations. We suggest that young allos- 
pecies reflect species turnover within the same ecological niche“, with extinction 
attributable to both small population size and recent climatic change, followed by 
colonization and population fragmentation creating peripheral forms. These allo- 
patric forms would not be recognized as different species in the fossil record* and 
depend on expansion out of more stable refuges, such as the eastern Himalayas is 
inferred to have been’>”’. This then suggests that a local-to-regional focus (stage 5 
of Extended Data Fig. 1) as a limit on regional species numbers is at least as appro- 
priate as the common regional-to-local focus as a limit on local diversity (compare 
ref. 46). The regional-to-local focus forms the basis for considering the local fauna 
to be a sample from the greater region, as tested in a previous section. However, if 
local interactions prevent the build-up of species in sympatry, the regional fauna is 
ultimately limited by local interactions, rather than the other way round. 

Extended Data Fig. 2 also indicates some sympatry among Zosterops species. 

First, a pair of species on Reunion and Mauritius is thought to be the result of a 
double invasion’’ where one species (Z. olivacea) has become a flower specialist and 
is solitary**. Second, a clade of allopatric forms (the lower five species in Extended 
Data Fig. 2) contains some species sympatric with Zosterops palpebrosus. In sym- 
patry, these species differ ecologically from Z. palpebrosus: in China, Z. japonicus is 
migratory, and species in the Indonesian islands are altitudinally segregated from 
Z. palpebrosus. Thus, these examples of sympatry may be considered a response to 
ecological conditions in different regions. Assessment of ecological controls on diver- 
sification would need to include both the timing of colonization and ecological 
amplitude in these regions. 
Models of ecological controls. In young environments, a large diversity of niches 
may appear more or less simultaneously, with subsequent rapid diversification to 
fill these niches, as in classic examples such as cichlid fish and Darwin’s finches. In 
other examples, opportunity may arise in pulses, or uniformly (for example, as a 
result of extinctions). Price”? and Harvey and Rambaut” considered a simple model 
in which new niches arose uniformly through time (for example, as a result of an 
occasional extinction) and species evolve to occupy the new niche from the ecol- 
ogically closest ancestor. As shown in Extended Data Fig. 3, this model creates a 
strong downturn in the lineage splitting pattern, as well as a burst of ecological and 
morphological diversification early in the radiation”. 

The results of the ecological controls model illustrated in Extended Data Fig. 3 
are broadly consistent with the pattern we observe for lineage splitting and mor- 
phological evolution in the east Himalayan assemblage (Fig. 2). However, elevation 
does not show this pattern, with disparity accumulating relatively late. We suggest 
that the best interpretation is that a burst of ecological opportunity was relatively 
recently created along the elevation axis, in association with climate change 10 Myr 
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to 6 Myr ago” (the mammalian fossil record of Pakistan shows a high turnover 
across this period”’.) 

Although morphological disparity in the 358 east Himalayan species phylogeny 

rapidly accumulates (Fig. 2, Extended Data Fig. 4), it still does so more slowly than 
the disparity curve of the simple niche-filling model illustrated in Extended Data 
Fig. 3. One contributing reason appears to be that the relatively recent appearance 
of high-elevation habitat creates opportunities along the other (body size and shape) 
axes, which are not always taken up by colonization from the ecologically most sim- 
ilar relative (an assumption of the model). A striking example is the high-elevation 
ground tit (Pseudopodoces humilis), which is the rightmost point in the tit clade of 
Fig. 2 (Parus and Pseudopodoces, shown in green). Far larger than other tits, Pseu- 
dopodoces humilis is morphologically convergent on low-elevation mynas: both 
P. humilis and mynas are predominantly open-country ground foragers. We spec- 
ulate that climatic, habitat (absence of open areas at mid-elevations), and geographic 
barriers caused the ‘open’ niche at higher elevations to be not easily invaded by 
mynas, triggering the evolution of the ground tit from a tree-living Parus. This 
species is separated by an estimated 8 Myr from its closest relative in the study area, 
and the speciation occurred about the time high-elevation radiations were taking 
place in other groups 10-6 Myr ago (see Extended Data Fig. 6). 
Significance tests of morphological evolution. For the main analysis in the text, 
we used the natural logarithm of mass—In(mass)—and the first two principal com- 
ponents (PC1, PC2) from the residuals of In(each morphological trait) on In(mass), 
using the entire Himalayan oscine data set for which we have measurements, N = 441 
species. Correlations with the original variables are in Extended Data Fig. 5. We 
standardized each of these three variables—In(mass), PC1 and PC2—to have unit 
variance. The motivation for the standardization is that subtle differences in shape 
can have as much ecological significance as more obvious changes in size'*"* mak- 
ing it more meaningful to give each equal weighting in morphological dispersion 
assessments. However, similar results on morphological volumes accrue if the values 
are unstandardized, in which case size becomes dominant. 

The time course of morphological evolution is illustrated in Extended Data Fig. 4, 
replicating Fig. 2, but with diversification expected under the Brownian-motion 
model added. We used two methods to assess the significance of the slowdown in 
morphological evolution. First, we fitted two-rate Brownian-motion models for mor- 
phological traits to the east Himalayan oscine phylogeny using Geiger (Extended 
Data Fig. 5a, b). The two-rate model includes two additional parameters over the 
one-rate model: the breakpoint and the rate difference before and after the break- 
point. To account for uncertainty in tree topology in assessment of significance, 
we averaged likelihood scores over 100 trees sampled from the Bayesian posterior 
distribution of trees, and to assess significance we used the likelihood profile (Extended 
Data Fig. 5b). 

Ina second approach, we compared Ornstein—Uhlenbeck models to Brownian 
motion models (Extended Data Fig. 5c). This analysis was modified in several ways 
from the analysis in the previous paragraph. First, unlike the disparity plots (Fig. 2 
and Extended Data Fig. 4) and the assessment in Extended Data Fig. 5a, we extracted 
three phylogenetically corrected principal-component scores” from the correla- 
tion matrix of the log-transformed morphological measurements, for the eastern 
oscine species alone (N = 355 species for which we have measurements, out of 358 
species in total). The correlations (“revellePCs”) with the original variables are in 
Extended Data Fig. 5d. We then asked at what point in time morphological evolu- 
tion becomes constrained, by fitting Ornstein-Uhlenbeck and Brownian motion 
models to all clades subtended above a certain timeline, plus the one additional 
clade from the root to that timeline. In the Ornstein-Uhlenbeck models, all line- 
ages subtending more than one species at each timeline were modelled with sepa- 
rate optima, but the constraining parameter was assumed to be the same for each 
clade (compare ref. 51). In all models the dispersive parameter was set the same 
throughout the tree. Relative weights of the two models were computed from the 
corrected (second-order) Akaike Information Criterion (AICc) scores obtained in 
the R program OUCH™. For both the Ornstein-Uhlenbeck and Brownian-motion 
models, at every timeline those lineages subtending just a single present-day species 
were removed before the analysis. 

Both the tests indicate a strong slowdown in morphological evolution towards 
the present. This slowdown is not simply a result of a slowdown in lineage splitting, 
with morphological evolution concentrated at speciation events”. Over half of all 
speciation events occur after the slowdown in morphology has largely happened. 
These speciation events are often associated with elevation splits, ecological differ- 
ences that are accompanied by relatively little morphological differentiation. As 
noted above, the slowdowns in morphological evolution are consistent with early 
diversification to fill niches associated with the major modes of life, as expected if 
ecological controls form the ultimate limit on species diversity and speciation. 
Comparing morphology and elevation. The mid-point of elevation has evolved 
more recently than body size and body shape (Fig. 2, Extended Data Fig. 4). This 
is shown also by phylogenetic signal in the data, as measured using Blomberg’s 


LETTER 


K value™*”°. K is greater than 1 for size and shape (1.74 for size and 1.74 for shape, 
that is, close relatives are more similar than expected under the model of Brownian 
motion), and less than 1 for elevation (0.63, that is, there is more evolution in the 
terminal branches than expected under Brownian motion). Phylogenetic signal, as 
assessed by comparing the variance of phylogenetic contrasts in a randomization 
test®* is significant for size and shape (P = 0.02), but not for elevation (P = 0.07). 

To compare patterns of evolution for any pair of traits (for example, body size 

with elevation) we adapted the method of ref. 56. We computed the absolute values 
of the phylogenetic contrasts throughout the tree, and regressed them on node height: 
a positive value indicates more evolution at the tips. Regression slopes are negative 
for size and shape and positive for elevation. We then compared the slopes of the 
regressions with two-sample Student’s t-tests (with 351 degrees of freedom, as mor- 
phology was not available for four of the 358 species, implying 353 contrasts). The 
results were: size versus elevation, t = 3.9, two-tailed P < 0.001; size versus shape, 
t= 1.34, P= 0.18; and shape versus elevation, t = 2.8, P = 0.006. All traits were stan- 
dardized to unit variance before the analysis, to remove scale effects. 
Phylogeny construction. For taxonomic sampling, we incorporated all 461 oscines, 
all 5 suboscines and 133 of the 149 nonpasserines (from the Piciformes, Bucero- 
tiformes, Upupiformes, Trogoniformes, Coraciiformes, Cuculiformes, Psittaciformes, 
Apodiformes, Strigiformes and Columbiformes, excluding Galliformes, Anseriformes 
and Falconiformes) found in the Himalayas”'”’. The resulting data set includes over 
120 species sequenced for the first time, including rare species such as the Bugun 
liocichla (Liocichla bugunorum), known only from a few pairs. We also included 
37 non-Himalayan species for the purpose of time-calibration (see source data for 
Fig. 1). Taxonomy follows ref. 57, except as outlined in the source data file assoc- 
iated with Fig. 1. 

The majority of new sequences in this study were amplified using tissues sourced 
from museum collections or from feathers and blood collected in the field. How- 
ever, we had to rely exclusively upon material from toe-pads for 54 species (31 passer- 
ines and 23 nonpasserines) without available fresh material. Owing to the degraded 
nature of DNA in these samples, amplification of mitochondrial loci proved diffi- 
cult and nuclear data often impossible (see DNA extraction, amplification and sequenc- 
ing). To alleviate this limitation and maximize the amount of data used for phylogenetic 
analyses, in some cases we incorporated genetic data from non-Himalayan sur- 
rogate species. We incorporated sequence data from surrogate species, as listed in 
the source data for Fig. 1, according to three different rationales: (1) ifa species was 
the only member of its genus in the Himalayas we used an extra-limital congener; 
(2) if a species’ genus contained more than one species in the Himalayas but the 
non-Himalayan sister species of that particular species was clear and (3) ifa genus 
was not monotypic in the Himalayas and no locus-specific data for that genus was 
available for any of its Himalayan members, locus data from a non-Himalayan 
representative was assigned to a single Himalayan representative. 

For example, the genus Oriolus contains four species that inhabit the Himalayas, 
for none of which we had adequate material to amplify the RAGI locus and none of 
which have sequence data for the RAG1 locus on GenBank. Here, we added RAGI 
from the non-Himalayan Oriolus cruentus for the Himalayan Oriolus kundoo because 
we have no other RAGI data for this genus (rationale 3). Rationale 3 makes the 
assumption that the genus in question is monophyletic, which is most likely to be 
the case when further qualified by rationale 2. To assess whether the use of genetic 
data from surrogate species induced a bias in phylogenetic reconstruction, we 
repeated the phylogenetic analyses outlined below with all surrogate information 
removed. No bias was detected. 

DNA extraction, amplification, and sequencing. DNA was extracted using Qiagen 
DNEasy Blood and Tissue kits (Qiagen) according to the manufacturer’s instruc- 
tions. Museum tissue and toe-pad materials were digested overnight in a mixture of 
180 pl ATL buffer and 20 1 proteinase-K and then eluted with 200 pl or 50 plof AE 
buffer respectively, following the manufacturer’s instructions. We sequenced the mito- 
chondrial cytochrome b gene (cytb), the mitochondrial NADH dehydrogenase 
subunit 2 (ND2), intron 2 of the myoglobin gene, introns 6 and 7 of the ornithine decar- 
boxylase (ODC) gene, intron 11 of the glyceraldehyde-3-phosphodehydrogenase 
(GAPDH) gene, and the recombination activating protein 1 (RAG1) gene—partial 
coding sequences. The six loci were amplified and sequenced using standard primers 
and amplification profiles as described in ref. 58 for mitochondrial cytochrome b, 
by M. D. Sorenson et al. (http://people.bu.edu/msoren/primers.html) for mitochon- 
drial ND2, ref. 59 for myoglobin and GAPDH, and ref. 60 for RAGI. We designed 
new primers to amplify the ODC locus for this study. Toe-pad samples were am- 
plified as needed with a series of short, overlapping fragments of 200-600 base 
pairs, using a large set of genera-specific internal primers. All primers used in this 
study are available for use on request. Polymerase chain reaction (PCR) products 
were cleansed using ExoSap IT and sequenced directly on an ABI3730XL capillary 
sequencer. 

Phylogenetic analyses. Sequences for each locus were aligned using MAFFT v7.1 
(http://mafft.cbrc.jp/alignment/software/). Some manual adjustment was necessary 
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for nuclear sequences. Indels that were specific to a single species within the align- 
ment were removed to reduce noise. The best-fitting model for each locus was iden- 
tified with the AIC implemented in MrModelTest2 v2.3 (ref. 61), in conjunction 
with PAUP* (ref. 62), as follows: GTR+G for cytb and myoglobin; GTR+1+G for 
ND2, ODC, and RAGI; and HKY+G for GAPDH. 

A time-calibrated phylogeny of the Himalayan Passeriformes was estimated by 
Bayesian inference using BEAST v1.8 (ref. 28) with computational assistance from 
BEAGLE v2.0 (ref. 63). For computational reasons, we ran the passerines (including 
the suboscines) largely independently from the remaining nonpasserines. Some 
species were included in both runs to help with time-calibration. Each locus was 
assigned its own partition with unlinked substitution and clock models but with a 
linked tree model. Thirteen time calibrations were used to date the tree. All dates 
were modelled with a uniform prior. Biogeographic calibrations were modelled 
between the date of geographic activity and the present to place a maximum age 
boundary. Fossil calibrations were modelled between 80.0 million years ago and 
the date of the fossil to place a minimum age boundary. By using multiple cali- 
bration points scattered broadly throughout the tree, we expect the accuracy of age 
calibration to improve as the average distance between calibrated and uncalibrated 
nodes decreases™. We ran BEAST for 50 million generations sampling every 5,000 
generations for a total of 10,000 trees. We assessed run performance and deter- 
mined appropriate burn-in length using Tracer v1.5 (ref. 65) and constructed the 
maximum-likelihood clade credibility tree using TreeAnnotator v1.8 (ref. 28). Ten 
of the 13 time calibrations in the passerine part of the tree have been independently 
verified to be appropriate for use®’. We confirmed the additional three age calibra- 
tions using methods similar to those of ref. 66. 

For the main phylogeny illustrated in Fig. 1b, a time-calibrated nonpasserine 

tree was run using the same BEAST methods, as described above. We constrained 
the order-level topology according to ref. 67 both because of the difficulty of achiev- 
ing accurate order-level resolution with our limited data set and also because these 
deeper relationships are beyond the scope of this paper. We incorporated 11 time 
calibrations, three of which were also used for the oscine tree, using the same uni- 
form prior logic for fossil and biogeographic calibrations. We spliced together the 
Passeriformes and nonpasserine trees using APE®, based on the several passerine 
species included in the nonpasserine tree. Additional phylogeny appendices are 
available as follows: (1) passerine and nonpasserine trees submitted to http://treebase. 
org (http://purl.org/phylo/treebase/phylows/study/TB2:S15660); (2) the XML file 
used to create the passerine tree in BEAST (Supplementary Information). 
The Jetz et al.*' tree. The global tree of birds by ref. 31 includes all the species we 
studied here. However, among the 358 oscines in the eastern square, 85 species in 
their tree were without sequence data and were inserted into an assigned clade based 
on taxonomy, using the pure birth (Yule) model and nearly all the other species had 
less sequence data than we used. In our investigations we computed a maximum 
clade credibility tree on a sample of 100 trees (‘Hackett backbone’) downloaded 
from http://birdtree.org. The portion of the tree*! that includes the species we stu- 
died differs in several respects from ours and shows a much smaller slowdown in 
lineage-splitting events towards the present (the splitting-rate correlation coeffi- 
cient r = 0.13 lineages per lineage per Myr over the first third of the tree to r = 0.09 
to r = 0.085 over the last third; this compares with r = 0.27, r = 0.13 and r = 0.06 
in the presented tree). The difference appears to be for at least three reasons: (1) the 
much older root of the oscines in the tree of ref. 31 (53 Myr ago versus 34 Myr ago 
in the current one), (2) the tree of ref. 31 inserted 25% of the species using the pure 
birth model because no sequence data at all was available and (3) taxonomy is a 
poor guide to where these species should be inserted. 

The much older root in the tree of ref. 31 probably stems from the use of old 
fossils (in their analysis, young calibration dates could not be used, given the way 
clades were inserted into a backbone tree) and perhaps the use of log-normal priors, 
which gives undue weight to point estimates. The age for the root is incongruent with 
the fossil record, plus there is a relative lack of passerine morphological diversity”. 
The root of the Eurasian species in our tree dates precisely to the Oligocene/Eocene 
boundary (34 Myr ago), a period of extreme cooling with a drop of several degrees 
Celsius in the oceans over a short time”, and large turnover in the Asian mam- 
malian fossil record”’. It is therefore in accord with the idea that ecological oppor- 
tunity drove songbird diversification. 

Despite the conflicts between trees, a critical result on the age of ‘sisters’ (closest 
relatives on the phylogeny of the 358 sympatric oscine species) is consistent: the 
average age of sister pairs in the tree from ref. 31 (7 Myr + 0.47 Myr, N = 112 sisters, 
with 9 pairs separated by less than 2 Myr) is similar to that in the tree presented 
here (7.1 Myr + 0.3 Myr, N = 116, with 3 pairs separated by less than 2 Myr). The 
great age of Himalayan species appears to be robust. 

Estimating diversity along the elevational gradient. For Fig. 3a we used the 
phylogenetic species diversity measure of Helmus and Ives*”” because it captures 
phylogenetic structure in a way that is insensitive to phylogeny size. A value of 1 is 
equivalent to a star phylogeny, whereas a value close to zero indicates that many 


species are close relatives. We pruned the larger phylogeny to a smaller one that 
connects all species thought to be in a particular elevational belt, based on overlap- 
ping altitudinal ranges (see source data for Fig. 2). Results were similar if we used 
species present in our censuses of 5-hectare plots instead. 

In Extended Data Fig. 6 we show the plot of lineage versus time for the 500 m 
and 3,000 m elevation, both of which are estimated to contain the same number of 
species. The plot is cast on a linear scale to highlight the differences. The lower ele- 
vation has a more rapid accumulation of species from 20 Myr ago to 10 Myr ago, 
whereas the higher elevation has a more rapid accumulation from 10 Myr ago to 
6 Myr ago. Note that although the higher elevation consists of younger species, 
very few species accumulate at either location over the past 6 Myr. 

For Fig. 3b, as in the case of the phylogenetic diversity measure, we used all species 

thought to be in a particular elevational belt, based on overlapping altitudinal ranges. 
However, results were similar if we used species present in our censuses of 5-hectare 
plots instead (Extended Data Fig. 7). The volume of the convex hull was computed 
in morphological 3-space (Extended Data Fig. 5A, D) using the library FD”*”*. The 
convex hull is sensitive to sample size, hence is conservative with respect to the 
main finding (higher volume at higher altitudes, even though species numbers are 
less than at middle altitudes). A measure insensitive to sample size (summed vari- 
ance of the three dimensions) gave congruent results (not shown). In Extended Data 
Fig. 7 we show morphological space on the first two dimensions at four locations. At 
highest elevations, both the shape and size dimensions are greatest, which may 
partly reflect an absence of nonpasserine competitors'® as well as more open habitat, 
which favours a relatively long tarsus. 
Field methods. The ‘study area’ is bounded by the coordinates 88° E, 93.1° E, 26.3° N 
and 28.1° N, covers an area of 10,389 km?, and spans an elevational range from 37 m 
to 6,778 m. Essential climatic features are illustrated in Extended Data Fig. 8. Tem- 
perature, precipitation and estimated productivity decline monotonically with eleva- 
tion (Extended Data Fig. 8; ref. 75). Precipitation shows the greatest spatial variation 
at low elevations (Extended Data Fig. 8). 

T.D.P. and D.M. collected data over an elevational range of 150-4,100 m in six 

field seasons (2007-2012, mostly in the later years), preceded by three (T.D.P.) or 
more (D.M.) earlier visits to familiarize ourselves with the birds. We worked entirely 
in India, with no visits to Bhutan, and set up 5-hectare grids using the Global Posi- 
tioning System (GPS) in as undisturbed forest as possible, usually over 2 km from 
any substantial clearing. Locations of the grids are plotted in Extended Data Fig. 8, 
as listed in the Source Data for Fig. 4. Two observers walked slowly over a grid for 
two mornings during the breeding season, from dawn for about 4 h, mapping singing 
males and recording sightings. For the more common species, concordance among 
observers in the number of singing males was high. Although only one observer often 
recorded one or other of the uncommon species, both observers recorded similar 
numbers of species in total. We assume that a single observer’s record approximates 
breeding birds on the grid (that is, rare species which were seen by one or other of 
us are often at lower density than 1 pair per 5 hectares and some are unlikely to be 
breeding on the grid.) We analysed the surveys of T.D.P. and the surveys of D.M. 
plus assistants separately and got similar results. T.D.P.’s surveys are reported (see 
source data for Fig. 4 for the listing of surveys included). M.G.-H. collected arth- 
ropod data on or near nine of the grids, during three separate visits’”’’’. Methods 
usually involved placing fifty 200-litre garbage bags over lower branches, breaking 
the branch, anesthetizing the arthropods and then sorting through the bag (see ref. 
77 for standard errors.) Because not all grids were surveyed in this way, we used a 
quadratic curve fit to predict numbers of arthropods at a given elevation. The rel- 
atively few arthropods at low elevations were unexpected, so in May 2013, we made 
additional collections at a new site, plus some additional collections at site B1. See 
the source data for details on these collections. Vegetation data were collected on each 
grid”' and are also reported in the source data for Fig. 4. The smoothed lines in 
Fig. 4 were fitted using the lowess procedure in R (http://stat.ethz.ch/R-manual/ 
R-patched/library/stats/html/lowess.html). 
Mid-elevation peak in species richness. Species richness at the 2-km’ scale was 
estimated using the same raster as the http://worldclim.org 2.5-arcminutes reso- 
lution, which resulted in 5,246 squares across the study area. The points represent 
the total number of breeding species estimated for each square. To assign a species 
to a square, we derived its east and west range limits along the Himalayas from the 
maps of Rasmussen and Anderton” (as described in ref. 21), and its altitudinal 
limits from our own work (source data for Fig. 2). Smooth lines in Fig. 1 are General 
Additive Model (gam) fits using the mgcv library in R (http://cran.r-project.org/web/ 
packages/mgcv/index.html). Critical additional packages used to construct that 
figure were Sp’*””, Raster*’ and PBSmapping*!. 

Rahbek* drew attention to the presence ofa peak in bird species richness at mid- 
elevations, rather than a monotonic decline in species numbers with elevation. Ina 
survey of 78 bird studies McCain*’ found that about half showed a mid-elevation 
peak. Several ecological explanations have been presented. Rahbek* noted that mid- 
elevations in the neotropics have increased structural diversity, notably epiphytes, 
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McCain* found that species numbers often correlate with productivity (moun- 
tains that are dry at the base are those that tend to have mid-elevation peaks), 
Rosenzweig and Abramsky~ noted more generally that some mid-elevation peaks 
may arise even if productivity declines, owing to competition with other groups, and 
Terborgh™ showed that different guilds have different elevational patterns, which 
he related to different resource distributions. Our interpretation of the Himalayan 
gradient is consistent with available resources being the ultimate driver of the peak. 
The mid-elevation peak is restricted to insectivorous oscines, which is related to ar- 
thropod abundance: this in turn may result from both climatic factors (for example, 
a spring flush of leaf) and the presence of efficient competitors at lower elevations. 
However, several other hypotheses for the mid-elevation peak have been proposed®. 
First, larger areas may accumulate more species”. Within the study area (Extended 
Data Fig. 8), the area within 500-m bands (between elevations of 200 m and 3,700 m) 
is roughly similar, albeit with a slight mid-elevation peak (Extended Data Fig. 9a), 
but overall the low elevations (Indian plains) and the high elevations (Tibetan pla- 
teau) have much more area than the middle elevations. Three other hypotheses, not 
described in the main text, are considered in more detail. 

The first is a concern with data. If estimates of elevational ranges are simply 
based on connecting the highest and lowest records of a species (so-called range- 
through methods), then as one approaches the domain boundaries sampling error 
inevitably means that some species are missed**. Our 5-hectare plot surveys are 
unaffected by this issue and these local estimates match the integrated elevational 
distributions (compare Figs 1 and 4). Further, elevational ranges are unlikely to be 
underestimates of the true ranges because (1) they integrate over multiple years, 
publications and elevational gradients and (2) isolated breeding birds beyond their 
normal range limits may be especially recorded in the literature. Reason (2) means 
that it is conceivable that our ranges are over-estimated, but we have done our best 
to eliminate this on the basis of our field experiences. 

The second concern is that of a suitable null model against which to assess the 
mid-elevation peak. In one much-discussed model, whose null character has been 
debated*’**, shuffling ranges of fixed sizes on a bounded domain inevitably leads to 
more species at mid-elevations®. Some other null models give weak or no mid- 
elevation peaks**. In the original null model a prominent peak at mid-elevations is 
most expected if elevational ranges are a medium to large fraction of the total 
domain®’. In our study, oscine elevational ranges average 1,154 m, which is about a 
quarter of the distance from the plains to the treeline (just 32 of the 96 oscines at 
700 mare also found at 1,900 m and of the 155 at 1,900 m only 30 are also at 3,100 m; 
none of the species at 700 m are also at 3,100 m). The original mid-domain null 
model results in zero species at the boundaries”’. We simulated this version, sam- 
pling species range mid-points on the domain size most favourable for the null 
hypothesis (200 m, 3,800 m) and obtaining mean R’ associations of real and simu- 
lated data of 0.66, when richness was evaluated at every 100 m. Regions at the extremes 
lay above the 95% confidence limits from the simulated data and the whole transect 
between 1,000 m and 3,000 m lay below the null model confidence limits; a rela- 
tively poor fit. Because species distributions spread into south India and a long way 
above the treeline it is unclear what domain size to use: tests on larger domain sizes 
gave lower R? values. The mid-domain null has been generally rejected across bird 
studies®, including an application to the bird diversity gradient in the eastern Himalayas” 
and is clearly not compatible with the distributions of nonpasserines and those song- 
birds that are not insectivores, both of which show monotonic declines from low 
elevations (Figs 1 and 4). 

An important issue is whether the mid-elevation peak represents a “mass effect” 
whereby some species ina particular location are only present because of dispersal’, 
that is, they represent sink populations, which could not be maintained in the 
absence of immigration. In theory, species can disperse from both above and below 
at intermediate elevations but only from one direction closer to the domain edges. 
This may lead to a mid-elevation peak, with local plots saturated to similar levels 
but those at mid-elevations having excess sink species, that is, sink species” occur 
to a greater extent in the mid-elevations. To address this, we focused on the local 
(grid) surveys, which show a mid-elevation peak comparable to that at the larger 
scales (Fig. 4a). We listed as plausible candidates for sink species all species repre- 
sented by just a single breeding pair and which were under 300 m from either their 
upper or their lower elevational range limit (except in the case of grids below 300 m, 
when we used the criteria that these species should not extend south of the Himalayas 
into the plains or Assam hills). By this measure, up to 12 species at mid-elevations 
may be sink species. However, at elevations below 200 m up to 8 species may be 
sink species. If we remove sink species, the mid-elevation peak remains prominent 
(Extended Data Fig. 9b). 

The mid-elevation peak is attributed mostly to a large number of small species 
(<15 g). This pattern is consistent with a large number of small arthropods at these 
elevations”. At lower elevations, arthropod distributions are skewed towards larger 
sizes, and insectivores are also generally larger. Our hypothesis, following ref. 24, 
is that a large abundance of arthropods permits finer subdivision of the niche. 


LETTER 


Terborgh and Faaborg” noted that insectivores may more generally be able to 
achieve fine niche partitioning, and this may well not be reflected in large mor- 
phological differences. 


31. Jetz, W., Thomas, G. H., Joy, J. B., Hartmann, K. & Mooers, A. O. The global diversity 
of birds in space and time. Nature 491, 444-448 (2012). 

32. Price, T. Correlated evolution and independent contrasts. Phil. Trans. R. Soc. Lond. B 
352, 519-529 (1997). 

33. Hijmans, R. J., Cameron, S. E., Parra, J. L., Jones, P. G. & Jarvis, A. Very high 
resolution interpolated climate surfaces for global land areas. Int. J. Climatol. 25, 
1965-1978 (2005). 

34. Losos, J. B. Adaptive radiation, ecological opportunity, and evolutionary 
determinism. Am. Nat. 175, 623-639 (2010). 

35. Mayr, E. Systematics and the Origin of Species from the Viewpoint of a Zoologist 
(Columbia Univ. Press, 1942). 

36. Nosil, P. Ecological Speciation (Oxford Univ. Press, 2012). 

37. Price, T., Zee, J., Jamdar, K. & Jamdar, N. Bird species diversity along the 
Himalayas: a comparison of Himachal Pradesh with Kashmir. J. Bombay Nat. Hist 
Soc. 100, 394-409 (2003). 

38. Harvey, P. H., May, R. M. & Nee, S. Phylogenies without fossils. Evolution 48, 
523-529 (1994). 

39. Phillimore, A. B. & Price, T. D. Density-dependent cladogenesis in birds. PLoS Biol. 
6, 483-489 (2008). 

40. Phillimore, A. B. & Price, T. D. in Speciation and Patterns of Diversity (eds Butlin, R. K., 

Bridle, J. & Schluter, D.) 240-256 (Cambridge Univ. Press, 2009). 

41. Weir, J. T. Divergent patterns of species accumulation in lowland and highland 

neotropical birds. Evolution 60, 842-855 (2006). 

42. Etienne, R.S. & Rosindell, J. Prolonging the past counteracts the pull of the present: 

protracted speciation can explain observed slowdowns in diversification. Syst. Biol. 

61, 204-213 (2012). 

43. Pybus, O. G. & Harvey, P. H. Testing macro-evolutionary models using incomplete 

molecular phylogenies. Proc. R. Soc. Lond. B 267, 2267-2272 (2000). 

44. McPeek, M.A. The ecological dynamics of clade diversification and community 

assembly. Am. Nat. 172, E270-E284 (2008). 

45. Rabosky, D. L. Diversity-dependence, ecological speciation, and the role of 

competition in macroevolution. Annu. Rev. Ecol. Evol. Syst. 44, 481-502 (2013). 

46. Ricklefs, R. E. Community diversity: relative roles of local and regional processes. 

Science 235, 167-171 (1987). 

47. Coyne, J. A. & Price, T. D. Little evidence for sympatric speciation in island birds. 

Evolution 54, 2166-2171 (2000). 

48. Gill, F. B. Ecology and evolution of sympatric Mascarene white-eyes, Zosterops 

borbonica and Zosterops olivacea. Auk 88, 35-60 (1971). 

49. Badgley, C. et al. Ecological changes in Miocene mammalian record show impact 
of prolonged climatic forcing. Proc. Natl Acad. Sci. USA 105, 12145-12149 (2008). 

50. Revell, L. J. Size-correction and principal components for interspecific 
comparative studies. Evolution 63, 3258-3268 (2009). 

51. Beaulieu, J. M., Jnwueng, D. C., Boettiger, C. & O'Meara, B. C. Modeling stabilizing 
selection: expanding the Ornstein-Uhlenbeck model of adaptive evolution. 
Evolution 66, 2369-2383 (2012). 

52. Butler, M.A. & King, A. A. Phylogenetic comparative analysis: a modeling approach 
for adaptive evolution. Am. Nat. 164, 683-695 (2004). 

53. Ricklefs, R. E. Cladogenesis and morphological diversification in passerine birds. 
Nature 430, 338-341 (2004). 

54. Blomberg, S. P., Garland, T. & Ives, A. R. Testing for phylogenetic signal in 
comparative data: behavioral traits are more labile. Evolution 57, 717-745 (2003). 

55. Kembel, S. W. et a/. Picante: R tools for integrating phylogenies and ecology. 
Bioinformatics 26, 1463-1464 (2010). 

56. Mahler, D.L., Revell, L.J., Glor, R. E.& Losos, J. B. Ecological opportunity and the rate 
of morphological evolution in the diversification of Greater Antillean Anoles. 
Evolution 64, 2731-2745 (2010). 

57. Rasmussen, P.C. &Anderton, J.C. Birds of South Asia: the Ripley Guide (Lynx, 2005). 

58. Weir,J.T.& Schluter, D. The latitudinal gradient in recent speciation and extinction 
rates of birds and mammals. Science 315, 1574-1576 (2007). 

59. Dong, F.,Li,S.H.& Yang, X.J. Molecular systematics and diversification of the Asian 
scimitar babblers (Timaliidae, Aves) based on mitochondrial and nuclear DNA 
sequences. Mol. Phylogenet. Evol. 57, 1268-1275 (2010). 

60. Groth, J. G. & Barrowclough, G. F. Basal divergences in birds and the phylogenetic 
utility of the nuclear RAG-1 gene. Mol. Phylogenet. Evol. 12, 115-123 (1999). 

61. Nylander, J. A.A, Ronquist, F., Huelsenbeck, J. P. & Nieves-Aldrey, J. L. Bayesian 
phylogenetic analysis of combined data. Syst Biol. 53, 47-67 (2004). 

62. Swofford, D. L. PAUP*. Phylogenetic Analysis Using Parsimony (*and other methods) 
Version 4 (Sinauer, 2003). 

63. Ayres, D. L. et a/. BEAGLE: an application programming interface and high- 
performance computing library for statistical phylogenetics. Syst Biol. 61, 
170-173 (2012). 

64. Linder, H. P., Hardy, C. R. & Rutschmann, F. Taxon sampling effects in molecular 
clock dating: an example from the African Restionaceae. Mol. Phylogenet. Evol. 35, 
569-582 (2005). 

65. Rambaut, A. & Drummond, A. J. Tracer v1.4 http://beast.bio.ed.ac.uk/ (2007). 

66. Weir, J.T. & Schluter, D. Calibrating the avian molecular clock. Mol. Ecol. 17, 
2321-2328 (2008). 

67. Hackett, S.J. etal. A phylogenomic study of birds reveals their evolutionary history. 
Science 320, 1763-1768 (2008). 

68. Paradis, E., Claude, J. & Strimmer, K. APE: analyses of phylogenetics and evolution 
in R language. Bioinformatics 20, 289-290 (2004). 


©2014 Macmillan Publishers Limited. All rights reserved 


LETTER 


69. 


70. 
71. 
72. 
73. 
74. 


75. 


76. 
77. 


78. 
79. 
80. 
81. 


Mayr, G. The age of the crown group of passerine birds and its evolutionary 
significance—molecular calibrations versus the fossil record. Syst. Biodivers. 11, 
7-13 (2013). 

Hansen, J. et al. Target atmospheric CO»: where should humanity aim? Open 
Atmos. Sci, J. 2, 217-231 (2008). 

Kraatz, B. P. & Geisler, J. H. Eocene-Oligocene transition in Central Asia and its 
effects on mammalian evolution. Geology 38, 111-114 (2010). 

Helmus, M. R. & Ives, A. R. Phylogenetic diversity-area curves. Ecology 93, S31-S43 
(2012). 

Laliberté, E. & Legendre, P. A distance-based framework for measuring functional 
diversity from multiple traits. Ecology 91, 299-305 (2010). 

Laliberté, E. & Shipley, B. FD: measuring functional diversity from multiple traits, 
and other tools for functional ecology. R package version 1. 0-11, http://cran. 
r-project.org/web/packages/FD/index.html (2011). 

Acharya, B. K., Sanders, N. J., Vijayan, L. & Chettri, B. Elevational gradients in bird 
diversity in the eastern Himalaya: an evaluation of distribution patterns and their 
underlying mechanisms. PLoS ONE 6, e29097 (2011). 

Gross, S. & Price, T. Determinants of the northern and southern range limits of a 
warbler. J. Biogeogr. 27, 869-878 (2000). 

Ghosh-Harihar, M. Distribution and abundance of foliage-arthropods across 
elevational gradients in the east and west Himalayas. Ecol. Res. 28, 125-130 
(2013). 

Pebesma, E. J. & Bivand, R. S. Classes and methods for spatial data in R. R News 5, 
http://cran.r-project.org/doc/Rnews/ (2005). 

Bivand, R. S., Pebesma, E. J. & Gomez-Rubio, V. Applied Spatial Data Analysis with R. 
(Springer, 2008). 

Hijmans, R. J. & van Etten, J. Raster: geographic analysis and modeling with raster 
data. R package version 1. 9-92, http://CRAN.R-project.org (2011). 

Schnute, J. T., Boers, N. M. & Haigh, R. PBS Mapping 2: user’s guide. Can. Tech. Rep. 
Fish. Aquat. Sci. 2549, 126 (2004). 


82. 


92. 


93. 


Rahbek, C. The elevational gradient of species richness—a uniform pattern. 
Ecography 18, 200-205 (1995). 


. McCain, C. M. Global analysis of bird elevational diversity. Glob. Ecol. Biogeogr. 18, 


346-360 (2009). 


. Terborgh, J. Bird species diversity on an Andean elevational gradient. Ecology 58, 


1007-1019 (1977). 


. Sanders, N. J. & Rahbek, C. The patterns and causes of elevational diversity 


gradients. Ecography 35, 1-3 (2012). 


. Grytnes, J. A. & Vetaas, O. R. Species richness and altitude: a comparison between 


null models and interpolated plant species richness along the Himalayan 
altitudinal gradient, Nepal. Am. Nat. 159, 294-304 (2002). 


. Hawkins, B. A., Diniz, J. A. F. & Weis, A. E. The mid-domain effect and 


diversity gradients: is there anything to learn? Am. Nat. 166, E140-E143 
(2005). 


. Zapata, F.A., Gaston, K. J. & Chown, S. L. The mid-domain effect revisited. Am. Nat. 


166, E144-E£148 (2005). 


. Colwell, R. K., Rahbek, C. & Gotelli, N.J. The mid-domain effect: there’s a baby in the 


bathwater. Am. Nat. 166, £149-E154 (2005). 


. McCain, C. M. The mid-domain effect applied to elevational gradients: 


species richness of small mammals in Costa Rica. J. Biogeogr. 31, 19-31 
(2004). 


. Grytnes, J.A, Heegaard, E. & Romdal, T. S. Can the mass effect explain the 


mid-altitudinal peak in vascular plant species richness? Basic Appl. Ecol. 9, 
373-382 (2008). 

White, E. P. & Hurlbert, A. H. The combined influence of the local environment 
and regional enrichment on bird species richness. Am. Nat. 175, E35-E43 
(2010). 

Terborgh, J. W. & Faaborg, J. Saturation of bird communities in the West Indies. Am. 
Nat. 116, 178-195 (1980). 


©2014 Macmillan Publishers Limited. All rights reserved 


LETTER 


Time Space 


Extended Data Figure 1 | The speciation cycle. A species distributed across 
space becomes fragmented as a result of either vicariance (illustrated) or 
dispersal. After barrier formation, reproductive isolation develops. For the cycle 
to continue at least one of the species must expand into the other’s range, which 
requires reproductive isolation, and generally also ecological compatibility. 
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Extended Data Figure 2 | Close relatives of a single Himalayan species, 
the Oriental white-eye, Zosterops palpebrosus. Only species for which 
sequence data are available are included (tree from ref. 31). The range of 
Z. palpebrosus (light red) overlaps with members of the clade containing 
the lower 5 species, for example, with Z. japonicus (which is migratory) 


Zosterops montanus @ 


Zosterops japonicus 


1 Myr 


(light green) in eastern China and Z. atricapilla (red) and Z. montanus (blue) in 
Indonesia, where Z. palpebrosus is altitudinally segregated from them. Within 
the Z. palpebrosus clade, all species are allopatric replacements, except for 

the two species on the Mascarene Islands (bracketed). The timeline is from 
ref. 31. 
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Extended Data Figure 3 | Plot of lineage versus time and morphological 
disparity generated in a simple model of ecological controls”. In this model, 
new niches appear uniformly through time, and new species arise to fill them, 
with the criterion that new species are always derived from the ecologically 
most similar form**. For this simulation, the position of a new niche was 
drawn from a bivariate normal (x, y) with a correlation of 0.5, with 380 niches 
appearing sequentially and uniformly spaced in time. The result is a linear 
accumulation of species through time (that is, a downturn on the log scale), 
and most of the morphological variation accumulating early in the radiation 
(in the plot, disparity for one variable is shown). 
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Extended Data Figure 4 | Disparity plots for morphology and habitat, with _ posterior distribution of the Bayesian analysis. The shaded areas around the 
the null Brownian-motion model added. The large shaded area represents the data plots gives the 95% confidence limits based on phylogenetic uncertainty 
95% confidence limit from 100 simulations on 100 trees drawn from the (based on the same 100 trees as above). 
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Extended Data Figure 5 | Slowdowns in morphological evolution across the 
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tree of the east Himalayan oscines. a, Maximum likelihood breakpoints 
(the point in time at which one rate becomes favoured over the other) and 
changes in rate for two-rate models of morphological evolution. Significance 
values (*P < 0.01, **P < 0.001) refer to likelihood ratio tests comparing the 
one- and two-rate Brownian motion models (PC2, P = 0.16). 95% support 
limits (parentheses) were derived from likelihood profiles averaged across 
100 trees sampled from the posterior distribution of Bayesian trees. b, The 


likelihood profile for evolution of the first shape index (PC1). The likelihood for 
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Time (Myr from root) 


Measurement PCI PC2___revellPCl _revellPC2_ _revellPC3 
Beak length -0.2 0.54 0.89 0.16 -0.15 
Beak width -0.59 -0.08 0.92 -0.53 -0.19 
Beak depth -0.51 -0.04 0.95 -0.49 -0.09 
Wing length -0.09 -0.03 0.89 -0.16 0.23 
Tarsus length 0.27 0.23 0.71 0.18 0.57 


each (x, y) combination was obtained as the average across 100 trees, then log- 
transformed for the figure. Numbers are the difference in log-likelihood from 
the maximum (100). Only values less than 2 units are shown. The profiles are 
indicated by symbols (squares for the breakpoint, and circles for the rate 
difference). c, Relative weights of Ornstein-Uhlenbeck (OU) and Brownian- 
motion models of morphological evolution at different timelines, based on 
phylogenetically corrected principal components (revellePCs) (see text). 

d, Correlations of PC scores with the original (log-transformed) variables. 
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Extended Data Figure 6 | Plot of lineage diversity (on a linear scale) versus 


time for a phylogeny connecting all species present at 500 m and at 3,000 m. 
Eighty-two species are estimated to straddle each of these elevational bands. 
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Extended Data Figure 7 | Morphology at specified elevations. Grey linesare  B2, A3, B1 and G1), and green lines are the convex hulls for all common 

the convex hull for all species in the study area (points as in Fig. 2). Black lines | (>5 pairs per hectare) species on those grids. Number of species is the number 
are the convex hulls for all species whose elevational ranges include the of all bird species in that elevational belt, plus (in parentheses) the number of 
specified band. Blue lines are the convex hulls for all the species censused on _ songbirds censused on the grid. 

5-hectare grids at those elevations (see the source data, in order of elevation, 
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Extended Data Figure 8 | Climate data (from http://worldclim.org33). The _ predicted climate variables (minimum and maximum temperatures, and 


top panel shows precipitation mapped on to a topographical map of the study __ precipitation) for the 18 grids. Lines are least-squares regression slopes. 
area, showing the locations of the 18 grids. The bottom panels show three 
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Extended Data Figure 9 | Mid-elevation peak plots. The left panel shows area _oscines in the censused 5-hectare grids and number of oscines in those grids 
in 500-m bands between 200 m and 3,700 m in the study area (computed using _ discounted by possible sink species (rare species at the edge of their range). 
http://worldclim.org altitude data). The right panel shows the number of 
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Extended Data Table 1 | Significance in the downturn in the plot of lineage diversification versus time 


Sample space One tailed P (y) 

*1 504 Asian oscines P<0.001 (—2.80) 
* 1,662 Asian oscines P<0.001 (2.61) 
871 tips pure birth model P <0.001 (—2.95) 
1,546 tips pure birth model P<0.001 (-3.12) 
7,686 tips pure birth model P<0.001 (-3.49) 


The significance was assessed by the statistic, where a value of less than — 1.96 is considered a significant slowdown, when compared to the pure birth (Yule) model®?“4. The Pvalue was assessed by drawing 1,000 
random samples of 358 species from the indicated larger phylogenies; the average 7 value from the simulations is in parentheses. The asterisk indicates oscines considered to be Asian (from http://avibase.bsc- 
eoc.org), with relationships described by the tree of ref. 31, including those that Jetz et a/.*1 inserted without sequence data. The top row is based on the checklist of Clements 2013 and the second row on Clements 
2005, as given in the avibase database. Species with different names in the tree of ref. 31 and the Clements compilations were not included (N = 178 for 2005, N = 467 for 2013). The lower three rows are examples 
of trees simulated under the Yule model. In all cases the observed y statistic for the east Himalayan assemblage lay far outside the range of simulated values: The y statistic for the phylogeny connecting the 358 
oscines in the study area based on the tree used here (Fig. 1b) is -13.94 and from the tree of ref. 31 is —8.30 (357 oscines because two Corvus were not split in the tree of ref. 31). The y statistic for a phylogeny 
truncated at 3 Myr ago is — 14.72 (current tree in Fig. 1b, 346 tips), —10.11 (the tree of ref. 31, 330 tips). The time separating sister pairs in the 1,504 Asian species extracted from the Jetz et al. tree** is 4.0 Myr and 
the average time of separation of sisters in 358 species samples from that tree 4.5 Myr. For further comparisons with the tree of ref. 31 see the phylogeny construction section in the Methods. 
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Sensory stimulation shifts visual cortex from 
synchronous to asynchronous states 


Andrew Y. Y. Tan?*, Yuzhi Chen!”?*, Benjamin Scholl':**, Eyal Seidemann'? & Nicholas J. Priebe!’* 


In the mammalian cerebral cortex, neural responses are highly var- 
iable during spontaneous activity and sensory stimulation. To explain 
this variability, the cortex of alert animals has been proposed to be 
in an asynchronous high-conductance state in which irregular spik- 
ing arises from the convergence of large numbers of uncorrelated 
excitatory and inhibitory inputs onto individual neurons’ ~*. Sig- 
natures of this state are that a neuron’s membrane potential (V,,) 
hovers just below spike threshold, and its aggregate synaptic input 
is nearly Gaussian, arising from many uncorrelated inputs’ *. Alter- 
natively, irregular spiking could arise from infrequent correlated 
input events that elicit large fluctuations in V,, (refs 5, 6). To distin- 
guish between these hypotheses, we developed a technique to per- 
form whole-cell V,,, measurements from the cortex of behaving 
monkeys, focusing on primary visual cortex (V1) of monkeys per- 
forming a visual fixation task. Here we show that, contrary to the 
predictions of an asynchronous state, mean V,,, during fixation was 
far from threshold (14 mV) and spiking was triggered by occasional 
large spontaneous fluctuations. Distributions of V,, values were 
skewed beyond that expected for a range of Gaussian input®’, but 
were consistent with synaptic input arising from infrequent corre- 
lated events”*®. Furthermore, spontaneous fluctuations in V,,, were 
correlated with the surrounding network activity, as reflected in simul- 
taneously recorded nearby local field potential. Visual stimulation, 
however, led to responses more consistent with an asynchronous 
state: mean V,, approached threshold, fluctuations became more 
Gaussian, and correlations between single neurons and the surround- 
ing network were disrupted. These observations show that sensory 
drive can shift a common cortical circuitry from a synchronous to 
an asynchronous state. 

Cortical neurons show variable activity even after efforts are taken to 
fix temporal variations in sensory stimuli and attentional state*. This 
ongoing activity affects stimulus encoding and synaptic plasticity’, but 
its neural basis is not well understood. One hypothesis is that the var- 
iable activity in alert animals arises from connections between numerous 
uncorrelated excitatory and inhibitory inputs’ *. Such a network is con- 
sistent with studies of neural architecture’, and shows spiking statistics 
similar to those measured in extracellular studies*. Predictions of this 
hypothesis” *°” are that numerous uncorrelated inputs (Fig. 1a, bottom) 
cause V,,, to hover near spike threshold (Fig. 1a, top left) and to show 
distributions that are near Gaussian or skewed with tails at hyperpo- 
larized potentials (Fig. 1a, top right). In contrast, neurons may receive 
correlated input** (Fig. 1b, bottom) such that V, lies far below thresh- 
old and shows infrequent large excursions (Fig. 1b, top left), forming 
skewed distributions with tails at depolarized potentials (Fig. 1b, top 
right). Measurements of V,, from awake, non-behaving cats suggest 
an asynchronous state"’, but are also consistent with correlated input’. 
Data from behaving rodents in various attentional states have suggested 
different pictures'*"'®, but equivocally, because of the potential contribu- 
tions of uncontrolled sensory inputs and attentional states to V,, dynamics. 
Extracellular recordings in drowsy humans have demonstrated correlated 


spontaneous cortical activity, leaving open the possibility that correla- 
tions are absent during alertness'’. Accordingly, we performed the first 
whole-cell V,,, measurements from the cortex of monkeys actively engaged 
in a visual fixation task, allowing us to examine V,, in single V1 neurons 
of alert primates while minimizing variability due to sensory stimuli, 
eye movements and attentional state. 

We obtained intracellular’’, whole-cell’?”, current-clamp measure- 
ments of V,, from 31 V1 neurons in three macaque monkeys while they 
viewed gratings of different orientations (see Supplementary Informa- 
tion and Supplementary Video). Each trial began when a fixation spot 
was displayed at the centre of a monitor in front of the monkey. The 
monkey had to shift gaze to the fixation point and maintain tight fixa- 
tion for at least 1,500 ms to receive a reward. A drifting sinusoidal grat- 
ing was presented for 1,000 ms while the monkey was maintaining 
strict fixation. We analysed V,, during the fixation period only from 
trials in which the monkey performed the task successfully. V1 neurons 
were orientation-selective, and were classified as simple or complex 
(Supplementary Information and Extended Data Fig. 1). 

Comparison of V,,, in blank trials in which no visual stimulus was 
presented (Fig. 2a—c, left) with suprathreshold responses evoked by pre- 
ferred orientation gratings (Fig. 2a—c, right) shows that blank trial V,, 
was generally far from spike threshold. There were occasional large 
depolarizations during blank trials, which were manifested in the posi- 
tive skewness of V,,, amplitude histograms: these had longer tails at depo- 
larized potentials, even though traces had had spikes removed (Fig. 2a-c, 
left, orange histograms; see also Supplementary Information and Extended 
Data Fig. 2). Across neurons, the median distance between blank trial 
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Figure 1 | V,, characteristics depend on network state. a, Diagram of an 
asynchronous high-conductance state. A neuron receives numerous 
uncorrelated inputs (bottom), V,,, hovers near spike threshold (top left), 
forming distributions with low or negative skewness ¢ (top right). b, A neuron 
may instead receive correlated inputs (bottom) such that V,, lies farther from 
spike threshold and shows occasional large fluctuations (top left), forming 
distributions with high skewness ¢ (top right). 
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line (left). Horizontal scales for the histograms are logarithmic. Right: traces 
from three preferred orientation trials; the arrow indicates stimulus onset. 
Lower and upper dashed lines indicate the period of required fixation and the 
spike threshold, respectively. Results in a, b and ¢ are from different neurons. 
d, Distribution across neurons of distance between mean V,, during blank 
trials and spike threshold (n = 26). e, The population distribution of V,,, for 
blank trials is the average of each neuron’s normalized mean-subtracted 
distribution. f, Distribution across neurons of blank trial Vi, skewness. Light 
and dark bars in d and f indicate simple and complex cells, respectively. 


Vim and spike threshold was 13.9 mV (Fig. 2d). The median skewness of 
0.72 (Fig. 2e, f) differs from the near zero or negative skewness expected 
for a range of Gaussian input (Fig. 1a; see also Supplementary Infor- 
mation and Extended Data Fig. 2c), but is consistent with synaptic 
input arising from infrequent correlated events (Fig. 1b). These data show 
that in the absence of visual stimulation, V1 of macaques performing a 
visual fixation task is not in an asynchronous high-conductance state'*. 

By comparison, visual stimulation depolarized neurons (Fig. 2a-c, 
right, and Fig. 3a—c) and decreased the skewness of V,, deviations from 
the mean (Fig. 3a—c; see also Supplementary Information and Extended 
Data Fig. 3), an effect that was significant across the population (Fig. 3d; 
Wilcoxon signed-rank test, P < 0.0001; see also Supplementary Infor- 
mation and Extended Data Fig. 4). Together with observed increases 
in membrane conductance during visual stimulation**”* (Supplemen- 
tary Information and Extended Data Fig. 5), these results suggest that 


LETTER 


Fixation 


Stimulation 


1 
a raw V,, 

10-1 

10° 


Residuals 


bo 
1 
feed 
6 
=) 
L 
& 
So 


Membrane 
potential (mV) 
& 

3 
1 

o 


Proportion 


Power spectrum (mV? Hz”) 


40°) 4 


Eye position 
I 

| 

1 

; I 
I 

H 

I 

| 

1 

° 

io 

ws 

W 

1 

é 

Q 

oO 
S46 
baa 


wt i Ls 

— 0) 

De 2 0 20 1° "Fo Too 
Membrane potential Frequency (Hz) 


sv 


0.25. 


Membrane 
potential (mV) 
oh 
a 
1 1 1 
Proportion 
sd 
° Ro 
& 
_— 
P 
[oy 
i) ne 
u 
np 
a8 0 
f) 
Power spectrum (mV? Hz*) 
Pea 264 GSS 
»> SR b&SRB LSmNB 


c =—— = =—— —— -80 -60 ~40 
8 0.25 ¢=0.30 
3 49°} ra h 
a meas ~ ml) L 
8 a — luli, 
2s -20 ie) 20 1 10 100 
Membrane potential Frequency (Hz) 
-305 0.2 102 
t=0.31 10° 
= 102 
. | : oy, 
2Qe€ 0 ry = 
fs -75 -6€0 -45 2 
BE 54 § 02 E10 
22 + — 10° 
206 5 2 
TNL TW o 
. z 75 -60 -45 a 
-80 4 —— — —2 0. pee S 10° 
& ae oe ad 
= - > 10-2 
8 40) ¥—_ oe + 0 ls 
ES -10 0 10 1 10 100 
bas 2s Membrane potential Frequency (Hz) 
d e 
3 - 10 , 10° - 06 1 
re « & ’ 1 
’ 3 “8 , 
24 “ 5 io! “oe 8 oe 5 
“ B10 & & © Oe = 
1-4 e.42 © g ° %& 0 8 10 a 6 5 03 
of 7 @o £10) bse z *: g 
’ = “60 s , 
17 3% a a i) Pe 
1+ {——1—_~©s*10" T T 1 10° T 1 0 
=1 o 1 2 3 107 10° 10' 10? 10+ 10? 10° -10 (0) 10 20 
Solank Blank power Membrane potential (mV) 


Figure 3 | Visually evoked V,, is closer to threshold and has more Gaussian 
fluctuations. a-c, Left: V,, (top) and eye position (bottom) over pre-stimulus, 
stimulus and post-stimulus periods during fixation (green, lavender and 

grey bars, respectively) and inter-trial periods. Right: pre-stimulus, stimulus 
period raw value and residual (green, lavender-filled and lavender-outlined 
respectively) histograms and power spectra; shaded areas indicate low and high 
frequency ranges. Results in a, b and c are from different neurons. d, Skewness 
of V,, residuals during the preferred orientation versus that during blank 
trials, for each neuron. e, Mean power in V,, residuals during blank versus 
visual stimulation at low frequencies (0.5-4 Hz, left) and high frequencies 
(30-50 Hz, right). f, Distribution across neurons of mean V,, during stimulus 
(lavender) and post-stimulus (grey) periods, relative to mean pre-stimulus Vy. 
Light and dark circles in d and e indicate simple and complex cells, respectively. 


visual stimulation shifts the cortical network towards an asynchro- 
nous high-conductance state’. 

Visual stimulation also caused significant changes in the power of 
Vim fluctuations. Membrane potential showed greater power at low fre- 
quencies than at high frequencies during fixation, both before and 
during visual stimulation. Visual stimulation increased the power of 
Vim fluctuations from the trial average (that is, residuals) at high fre- 
quencies (30-50 Hz) but did not cause systematic changes at low fre- 
quencies (0.5-4 Hz) (Fig. 3a—c, e; Wilcoxon signed-rank test, P = 0.76 
(0.5-4 Hz), P = 0.001 (30-50 Hz); see also Supplementary Informa- 
tion and Extended Data Fig. 6). Post-stimulus V,, was typically below 
pre-stimulus values (Fig. 3f, Wilcoxon signed-rank test P< 0.0001). 

If, as our intracellular recordings suggest, visual stimulation shifts V1 
towards an asynchronous state, there should be a concomitant reduc- 
tion in the correlation between V,, and the surrounding network, as 
reflected in the simultaneously recorded nearby local field potential (LFP). 
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This was the case. During fixation with no visual stimulus, deflections 
indicating spontaneous increases in activity are evident in V,,, and LFP 
(Fig. 4a, left, depolarization for V,,, downward deflections for LFP). 
These deflections are coincident in both signals (asterisks in Fig. 4a); 
across our population, the zero-lag V,,—LFP cross-correlation was neg- 
ative during blank trials, reflecting coincident activation of the network 
and individual neurons (Fig. 4d, green, median cross-correlation —0.24; 
Wilcoxon signed-rank test, P< 0.01). To determine whether visual stimu- 
lation alters this relationship we examined V,,,-LFP correlations after trial 
averages had been subtracted (Fig. 4b, c, centre panels). Correlations 
declined when drifting gratings were presented (Fig. 4b-d; Wilcoxon 
signed-rank test, P< 0.01), such that the median cross-correlation was 
nearer zero (Fig. 4d, lavender; Wilcoxon signed-rank test, P = 0.91), 
providing further evidence that visual stimulation drives V1 towards 
an asynchronous state. The visually-evoked decline in V,,-LFP corre- 
lation was apparent for low frequency (0.5-4 Hz), but not high frequency 
fluctuations (Fig. 4e; Wilcoxon signed-rank test, P< 0.01 (0.5-4 Hz), 
P= 0.13 (30-50 Hz)); Vm-LFP coherence decreased at low (0.5-4 Hz), 
but not high frequencies (30-50 Hz) (Fig. 4b, c, right, and Fig. 4f; Wil- 
coxon signed-rank test, P< 0.05 (0.5-4 Hz), P = 0.34 (30-50 Hz); see 
also Supplementary Information and Extended Data Fig. 7). 

We have shown that in the absence of visual stimulation, V1 in alert 
behaving primates is not in an asynchronous high-conductance state’. 


a Fixation and b 
Fixation alone visual stimulation 
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Visual stimulation 
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Rather, spontaneous V,,, fluctuations are non-Gaussian and character- 
ized by occasional excursions from rest, consistent with synaptic input 
arising from infrequent correlated events”*. In our recordings, sensory 
stimulation drove V1 towards an asynchronous state, as visually evoked 
Vin was closer to spike threshold, showed more Gaussian fluctuations 
and became less correlated with low-frequency LFP. The visually evoked 
reduction in correlation between V,,, and LFP is consistent with previ- 
ously reported decreases in spiking correlations”***. In an analogous 
fashion, the correlated activity patterns observed in mouse sensory cortex"* 
during quiet wakefulness are disrupted by thalamic activation”. (See also 
Supplementary Information and Extended Data Fig. 8.) Our records 
focused on activity in superficial cortical layers; membrane potential 
characteristics may differ across layers, potentially reflecting laminar 
specificity in network state”®. 

How can cortical circuitry support synchronous and asynchronous 
states? One salient difference between the states was the amount of exter- 
nal input: without visual stimulation the thalamic drive to cortex is weak, 
whereas visual stimulation activates those afferents. We propose that 
this difference in afferent drive explains the shift in network state. Our 
proposal unifies observation and theory: a lower input spike rate reduces 
synaptic input so that V,, lies further from threshold; postsynaptic poten- 
tials due to different sources are less likely to overlap in time and instead 
appear as distinct events. Crucially, theory indicates that a low thalamic 
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Figure 4 | Magnitude of V,,,-LFP cross-correlation decreases during visual 
stimulation. a, Four trials of Vi, (black) with simultaneously recorded LFP 
(red) and eye position (thin black traces) during the blank (left) and the 
preferred orientation (right) from one neuron. b, ¢, Left: V,, and LFP from four 
trials with superimposed trial average. Centre: traces with trial averages 
subtracted. Pre., pre-stimulation; stim., stimulation; post., post-stimulation. 
Right: V,,-LFP cross-correlation functions and coherence magnitudes for 
blank (green) and stimulus (lavender) periods, and shuffled stimulus trials 
(grey). Shaded areas show bootstrapped s.e.m. for cross-correlation functions, 
and jack-knifed 95% confidence intervals for coherence magnitudes. Results in 
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Fixation coherence 


b and c are from different neurons. d, Zero-lag V,,—LFP cross-correlation 
during visual stimulation (lavender) versus blank trials (green) for each neuron, 
and corresponding marginal histograms. Medium and dark filled circles 
indicate cross-correlations that were respectively greater or less during visual 
stimulation than blank trials (Wilcoxon signed-rank test, P< 0.05). e, Zero-lag 
Vm-LFP cross-correlation at low frequencies (0.5-4 Hz, blue) and high 
frequencies (30-50 Hz, orange). f, Root-mean-square V,,—LFP coherence 
magnitudes at low frequencies (0.5-4 Hz, blue) and high frequencies 

(30-50 Hz, orange). 
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spike rate destabilizes the asynchronous state towards low-frequency 
correlations*”””*, but higher thalamic spike rates drive the network 
towards an asynchronous state in which correlations weaken*”””*, as 
observed in our data. It is clear that external drive alters the cortical 
state’, but internal factors are also essential. In extrastriate cortex, atten- 
tion causes an increase in overall response that is also accompanied by 
a decline in the correlation between neurons”. Explaining how these 
external and internal drives are synthesized will require understand- 
ing how V1 interacts with downstream areas. 


METHODS SUMMARY 


Macaque monkeys were trained to perform a visual fixation task (see also Sup- 
plementary Information and Extended Data Fig. 9), and implanted with recording 
chambers. Blind in vivo whole-cell recordings were performed’. Patch pipettes 
were filled with (in mM) 135 potassium gluconate, 4 NaCl, 0.5 EGTA, 2 MgATP” , 
10 phosphocreatine disodium, 10 HEPES, pH adjusted to 7.3 with KOH (Sigma- 
Aldrich) (see also Supplementary Information and Extended Data Fig. 10). 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
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METHODS 


All procedures were approved by the University of Texas Institutional Animal 
Care and Use Committee and conformed to National Institutes of Health stan- 
dards. Our general experimental procedures in behaving macaque monkeys have 
previously been described in detail**”. 

Behavioural task and visual stimulus. Three adult male macaque monkeys 
(Macaca mulatta) were trained to perform a visual fixation task in which gratings 
of different orientations were presented. Each trial began when a fixation spot was 
displayed at the centre of a monitor in front of the monkey. The monkey had to 
shift gaze to the fixation point and maintain fixation within a small window (less 
than 2° full width) for at least 1,500 ms to receive a reward. A drifting sinusoidal 
grating was presented at a randomized orientation for 1,000 ms while the monkey 
was maintaining strict fixation, thus minimizing variability due to eye movements. 
(See Supplementary Information and Extended Data Fig. 9 for characteristics of 
post-fixation saccades.) 

Visual stimuli were presented on a gamma-corrected high-end 21-inch colour 

display (Sony Trinitron GDM-F520) at a fixed mean luminance of 30cdm*. The 
display subtended 20.5° X 15.4° at a viewing distance of 108 cm and had a pixel 
resolution of 1,024 X 768, 30-bit colour depth and a refresh rate of 100 Hz. Visual 
stimuli were generated by using a high-end graphics card on a dedicated PC, using 
custom-designed software. Behavioural measurements and data acquisition were 
controlled by a PC running a software package for neurophysiological recordings 
from alert animals (Reflective Computing). Eye movements were measured with 
an infrared eye-tracking device (Dr Bouis). 
Whole cell recordings. Recording chambers were located on the dorsal portion of 
V1, with the anterior portion of the chamber reaching close to the lunate sulcus 
and the border between V1 and V2. We verified the retinotopic organization by 
voltage-sensitive dye imaging”, and by recording multiunit activity or local field 
potential with tungsten microelectrodes (Alpha Omega Co; MicroProbes for Life 
Sciences). The cortex in our cranial windows represents stimuli that are approxi- 
mately 2.5°-5° away from the fovea in the lower quadrant of the contralateral 
hemifield. 

Intracellular records of V,, (refs 18,34,35) were obtained with blind in vivo 
whole-cell recordings’? **. The recording chamber was filled with 2-4% agarose in 
artificial cerebrospinal fluid (CSF). Intracellular records were from neurons in the 
top 1,300 um of V1. Asa reference electrode, a silver-silver chloride wire was inserted 
into the agarose. The potential of the CSF was assumed to be uniform and equal to 
that of the reference electrode. Pipettes (6-12 MQ) were pulled from borosilicate 
glass capillaries (KG-33, 1.2 mm outer diameter, 0.70 mm inner diameter; King 
Precision Glass) on a P-2000 micropipette puller (Sutter Instruments). Patch pipettes 
were filled with (in mM) 135 potassium gluconate, 4 NaCl, 0.5 EGTA, 2 MgATP”, 
10 phosphocreatine disodium, 10 HEPES, pH adjusted to 7.3 with KOH (Sigma- 
Aldrich). Whole-cell current-clamp recordings were performed with an Axoclamp 
2B Microelectrode Amplifier (Molecular Devices). We subtracted 7 mV from all 
raw membrane potential values to compensate for the liquid junction potential**. 
(See Supplementary Information and Extended Data Fig. 10 for intrinsic proper- 
ties of recorded neurons.) 

Data analysis. We analysed V,, during the fixation period in trials during which 
the monkey performed the task successfully, provided that the mean V,, in the 
absence ofa visual stimulus was less than —50 mV. V,, was detrended by high-pass 
filtering at 0.1 Hz. Data were analysed with MATLAB (Mathworks). Shot noise 
contributions to V,, were assessed by the skewness°**”~” of V,,, distributions. Coher- 
ence estimates were performed with Chronux”’, a MATLAB library (freely available 
from http://chronux.org/). 

Data analysis for Supplementary Information. The relationship between spike 
rate R and V,, was described with a threshold followed by a power law*!*: R= 
A[Vin — Vin], » where A is a fitted constant, Vi; is the resting membrane potential, 
+ indicates rectification, and p is the fitted exponent. Orientation selectivity was 


assessed with an orientation selectivity index*** (vector average = 1 — circular 
variance). Temporal modulation was assessed with the Fourier component of the 
response with the same temporal frequency as the moving sinusoidal grating visual 
stimulus divided by the time-averaged response*® (F,/Fo). Simulations of Hodgkin- 
Huxley neurons used parameters adapted from refs 47 and 48, and were performed 
with Brian’. We estimated membrane conductance from voltage responses to 
hyperpolarizing current pulses of constant amplitude, and a fit of a sum of two expo- 
nentials to the voltage response”!: 


V(t) =Tinj[((Rm(1 — exp(—t/tm))) + (Re(1 — exp(— t/te)))| 


where V is the voltage response, ¢ is time, Tinj is injected current, Ry), is membrane 
resistance, T)4 is membrane time constant, Rp is electrode resistance and Tr is elec- 
trode time constant. Membrane conductance is 1/Ry,. 
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Extended Data Figure 1 | Orientation tuning of V,, and spike rate. a, Vin 
responses (top traces), eye position traces (bottom pairs of traces) from three 
blank trials (left), three trials at the preferred orientation (centre), and three 
trials at the orthogonal orientation (right). b, Trial averaged V,,, (top) and spike 
rate (bottom) for all orientations, from the neuron in a. c, Spike rate versus 
membrane potential, and best-fit thresholded power law, from the neuron in 
a. d, Orientation tuning curves for V,, and spike rate, and predicted spike rate 
orientation tuning curve using the V,, orientation tuning curve and the best-fit 
thresholded power law in c, from the neuron in a. e, Orientation selectivity 
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index (OSI) for spike rate versus OSI for Vin. Lines represent expected 
relationships between spike rate OSI and V,,, OSI for thresholded power laws 
with exponents 2, 3 and 5 (blue, red and black, respectively). f, Fourier 
component of the response with the same temporal frequency as the moving 
sinusoidal grating visual stimulus divided by the time-averaged response for 
spike rate (Rj/Ro) versus that for Vin (Vi/Vo). Lines represent expected 
relationships between R,/Ry and V,/Vo for thresholded power laws with 
exponents 2, 3 and 5 (blue, red and black, respectively). 
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Extended Data Figure 2 | Estimation and implication of V,, skewness 
during blank trials. a, Gaussian excitatory (black) and inhibitory (red) 
conductances, V,, with spiking disabled (green), V,,, with spiking enabled (light 
blue), and V,,, with spikes removed (dark blue), and corresponding Vin 
amplitude histograms and skewness values (, for a simulated neuron with 
Hodgkin—Huxley conductances. b, V,;, with spiking (light blue) and with spikes 


We ol 
-70 -60 -50 -40 -30 
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removed (dark blue) and corresponding V,,, amplitude histograms and 
skewness values C, for a recorded neuron. c, Apparent skewness from V,, with 
spikes removed versus skewness from V,, with spiking disabled from a 
simulated neuron with Hodgkin-Huxley conductances, for a range of Gaussian 
inputs. 
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Extended Data Figure 3 | Estimation of V,, skewness during visual histogram); residual traces from each cycle after subtraction of cycle-averaged 


stimulation trials. a, Raw traces from several trials. b, Traces after bandpass _ response (bottom grey traces), cycle-averaged residuals (bottom black trace) 
filtering and spike removal. c, V,, responses from each cycle (top grey traces), and histogram of V,, residuals (bottom histogram). Note the change in vertical 
cycle-averaged response (top black trace) and histogram of V,, responses (top __ scale from top to bottom panels. 
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Extended Data Figure 4 | Joint distribution of V,,-threshold distance and _ blank trials. b, Joint distribution of V,,-threshold distance and skewness ¢ 
skewness. a, Joint distribution of V,,-threshold distance and skewness (during _ during preferred orientation trials. 
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Extended Data Figure 5 | Membrane conductance during blank and visual | membrane conductance during visual stimulation in two example neurons 
stimulation trials. a, Distribution of membrane resistance (left) and during blank (left), preferred (centre) and 45° from preferred (right) trials. Each 
corresponding membrane conductance (right) during blank trials. b, Change in row shows data from a different neuron. 
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Extended Data Figure 6 | Power spectra of V,, and LFP fluctuations from 
the trial average. a, Power spectrum of V,,, (top panels) and LFP (bottom 
panels) fluctuations from the trial average (residuals) during blank trials 
(left panels), residuals during preferred orientation stimulation (middle 
panels), and raw V,, traces during preferred orientation stimulation (right 


panel). Each trace corresponds to an individual neuron. b, Population-averaged 
ratio of power spectrum at the preferred orientation to power spectrum for 
blank trials for V,, fluctuations from the trial average (‘V;, residuals’, left panel), 
LFP fluctuations from the trial average (“LFP residuals’, middle panel), and raw 
Vm (right panel). Error bars are jack-knifed standard errors. 
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during a flashed stimulus in a visual saccade task. a, Each trial began whena _ Asterisks indicate near-simultaneous deflections in V,, and LFP during the 
fixation spot was displayed at the centre of a monitor in front of the monkey. _ pre-stimulus fixation period. Grey shading indicates the analysis period for 


The monkey had to shift gaze to the fixation point and maintain tight correlations during the flashed Gabor stimulus; we included 30 ms after saccade 
fixation for at least 1,500 ms. A flashed Gabor target stimulus appeared at a onset in this period, because the visual latency for spike responses in the 
random time between 1,000 and 1,500 ms after the monkey had established lateral geniculate nucleus is greater than 30 ms. c, Zero-lag cross-correlation 
tight fixation. The monkey had to saccade to the target stimulus within 600 ms between V,, and LFP fluctuations from the trial average during the flashed 

to receive a reward. We analysed V,,, and LFP only from trials in which the Gabor stimulus versus during the pre-stimulus period. 


monkey performed the task successfully. b, Simultaneously recorded V,, and 
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Extended Data Figure 9 | Summary of first saccade latency and peak velocity 
in monkeys T and W, which together contributed the majority of the 
recorded data. a, Top: histogram of latency of first saccade after fixation point 
termination in three neurons (158 trials) in monkey W. Arrow indicates 
median latency (217 ms). In 1.9% of the trials no saccade was detected in 

the 600 ms after fixation point termination. Bottom: histogram of peak eye 
velocity for first saccades during the 600 ms after fixation point offset. Arrow 
indicates median peak velocity (292° s_'). b, Results from eight neurons 

(464 trials) in monkey T. The format is the same as in a. Median latency is 


LETTER 


100 y 


>600 


60! 12.5%| 


# of trials 


40; 


20; 


0 
0 200 600 


400 
Time of 1st saccade 


after fixation point off (ms) 


v 


40 | 


ie) 
[o) 


# of trials 
8 


— 
oO 


400 800 1000 


600 
Peak speed of 1st saccade (deg/sec) 


0 200 


314 ms and median peak velocity is 229° s_'. Monkey W tended to make larger 
saccades away from fixation, whereas monkey T tended to make smaller 
saccades and in a small subset of the trials remained close to the fixation point 
location until the next trial was initiated. This may reflect the fact that the 
minimal inter-trial interval was shorter in monkey T than in monkey W. 

The short latency of the saccades after fixation point termination in the vast 
majority of the trials indicates that both monkeys were alert and attentive and 
were actively engaged in maintaining tight fixation. 
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Extended Data Figure 10 | Regular-spiking neurons. a, V,,, response to interspike interval increased with interval ordinal, indicating that this neuron 
injections of current steps of different magnitudes in an example neuron. was regular-spiking. 
b, Interspike interval during the current step versus interval ordinal. The 


©2014 Macmillan Publishers Limited. All rights reserved 


1 sd Wal Be 


doi:10.1038/nature13168 


Listeria monocytogenes exploits efferocytosis to 
promote cell-to-cell spread 


Mark A. Czuczman!?, Ramzi Fattouh!, Jorik M. van Rijn’, Veronica Canadien', Suzanne Osbornel, Aleixo M. Muise 


Vijay K. Kuchroo’, Darren E. Higgins® & John H. Brumell'”*"° 


Efferocytosis, the process by which dying or dead cells are removed 
by phagocytosis, has an important role in development, tissue homeo- 
stasis and innate immunity’. Efferocytosis is mediated, in part, by 
receptors that bind to exofacial phosphatidylserine (PS) on cells or 
cellular debris after loss of plasma membrane asymmetry. Here we 
show that a bacterial pathogen, Listeria monocytogenes, can exploit 
efferocytosis to promote cell-to-cell spread during infection. These 
bacteria can escape the phagosome in host cells by using the pore- 
forming toxin listeriolysin O (LLO) and two phospholipase C enzymes”. 
Expression of the cell surface protein ActA allows L. monocytogenes 
to activate host actin regulatory factors and undergo actin-based motil- 
ity in the cytosol, eventually leading to formation of actin-rich pro- 
trusions at the cell surface. Here we show that protrusion formation 
is associated with plasma membrane damage due to LLO’s pore- 
forming activity. LLO also promotes the release of bacteria-containing 
protrusions from the host cell, generating membrane-derived vesi- 
cles with exofacial PS. The PS-binding receptor TIM-4 (encoded 
by the Timd4 gene) contributes to efficient cell-to-cell spread by 
L. monocytogenes in macrophages in vitro and growth of these bac- 
teria is impaired in Timd4 ’~ mice. Thus, L. monocytogenes pro- 
motes its dissemination in a host by exploiting efferocytosis. Our 
results indicate that PS-targeted therapeutics may be useful in the 
fight against infections by L. monocytogenes and other bacteria 
that use similar strategies of cell-to-cell spread during infection. 

The intermediate stages of cell-to-cell spread by L. monocytogenes 
remain unclear. On the basis of observations with an in vitro infection 
model, it has been suggested that bacteria-containing protrusions are 
released from infected cells before uptake of membrane vesicles contain- 
ing bacteria by neighbouring cells’. However, the mechanisms that medi- 
ate protrusion release and uptake of bacteria in vesicles are not known. 

LLO is required for L. monocytogenes cell-to-cell spread in some cell 
types, including macrophages*”. LLO is a pore-forming toxin that is 
often referred to as a ‘phagosome-specific lysin’® because it has limited 
activity in the cytosol of host cells, owing to its relatively low lytic 
activity’ and stability* at neutral pH. Furthermore, LLO is degraded by 
the proteasome’. Despite these factors, it is now appreciated that LLO 
can damage the plasma membrane of host cells'°. Host membrane repair 
pathways limit LLO-mediated membrane damage”’, but the mechan- 
isms by which they act remain unclear. LLO is essential for disruption 
of the outer membrane of spreading vacuoles*. Whether LLO contri- 
butes to other stages of cell-to-cell spread has not been tested. 

We proposed that LLO-mediated damage to the plasma membrane 
may promote cell-to-cell spread. We used a propidium iodide (PI) assay 
to measure membrane damage induced during L. monocytogenes infec- 
tion (Fig. 1a). Repair of the plasma membrane is a Ca**-dependent 
process!?. Therefore, the absence of Ca** in the medium provided a 
convenient method to inactivate endogenous repair mechanisms and 
visualize the full extent of membrane damage. HeLa cells were used for 


1,4,5,6 
> 


these studies as phagosome escape by L. monocytogenes does not require 
LLO in this cell type’’. 

In the absence of extracellular Ca?*, infection of cells with wild-type 
bacteria revealed an increase in membrane damage compared with un- 
infected cells (Fig. 1b, c). The number of PI* cells increased over time, 
indicating that membrane damage was an ongoing event during infec- 
tion. Less damage was observed when Ca** was present in the extracel- 
lular medium, suggesting that Ca**-dependent repair pathways limit 
plasma membrane damage. 

Caspase 7 promotes membrane repair during L. monocytogenes infec- 
tion of macrophages"'. Consistent with this, we found that short interfer- 
ing RNA (siRNA)-mediated knockdown of caspase 7 increased membrane 
damage induced by L. monocytogenes (Extended Data Fig. 1a, b). How- 
ever, this effect was minor, indicating that other factors contribute to 
membrane repair. Annexins also have a role in membrane repair. We 
found that siRNA-mediated knockdown of annexins Al, A2 and A6 
led to an increase in membrane damage (Extended Data Fig. la, b). We 
conclude that multiple host factors contribute to repair of the plasma 
membrane during L. monocytogenes infection. 

LLO damages host membranes during infection'®’’. Consistent with 
this, a mutant lacking LLO (Ahly) was impaired in membrane damage, 
and complementation with hly restored membrane damage (Fig. 1d 
and Extended Data Fig. 2a). Deletion of both phospholipase C (PLC) 
enzymes had no effect on membrane damage in Ca~” -free media. How- 
ever, PLCs were required for membrane damage in Ca” * -containing 
media, suggesting that they may promote LLO activity and/or inhibit 
membrane repair mechanisms. We observed a decrease in membrane 
damage in cells infected with ActA-deficient (AactA) bacteria. Treat- 
ment of cells with the actin inhibitors latrunculin B or cytochalasin D 
hada similar effect (Extended Data Fig. 2b). Thus, actin-based motility 
may promote membrane damage by LLO by allowing close apposition 
of bacteria to the plasma membrane. 

Membrane damage is associated with the loss of membrane asymmetry 
and redistribution of PS to the outer leaflet of the plasma membrane”. 
Therefore, to visualize membrane damage induced by L. monocyto- 
genes we stained cells with a probe (annexin A5-Alexa 488) to label exo- 
facial PS. In uninfected cells, low amounts of exofacial PS were detected 
(Extended Data Fig. 3). By contrast, treatment of cells with saponin led 
to staining of cells with annexin A5-Alexa 488. 

In cells infected by wild-type bacteria, we observed the formation of 
PS* structures at the cell surface (Fig. 2a, right, and Extended Data Fig. 4). 
These structures co-localized with bacteria (visualized by red fluores- 
cent protein (RFP) expression’®) and were not associated with cellular 
blebbing, rounding or nuclear condensation, indicating that they were 
not the consequence of apoptosis. Extracellular bacteria did not label 
with annexin A5-Alexa 488 and bacteria present in PS* structures were 
not accessible to anti-L. monocytogenes antibodies (Extended Data Fig. 5). 
This indicated that PS* bacteria were present in a host-derived membrane 
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Figure 1 | Actin-based motility promotes LLO-mediated membrane 
damage. a, Experimental design for membrane damage assay. b, Confocal 
images of HeLa cells infected as in a with wild-type L. monocytogenes at a 
multiplicity of infection (MOI) of 100. DAPI, 4’,6-diamidino-2-phenylindole. 
Scale bars, 10 jum. Images are representative of three independent experiments. 
c, Cells were infected as in a for the indicated time and PI* cells were 
enumerated (n = 100). Data are shown as averages + standard deviation (s.d.) 
for four independent experiments. P values were calculated using two-tailed 
Student’s t-test. d, Averages + s.d. for three independent experiments are 
shown. P values were calculated using one-way analysis of variance (ANOVA). 
*P < 0.05, **P< 0.01, ***P < 0.001. 


structure. Correlative microscopy revealed that the PS structures were 
associated with the surface of infected cells (Fig. 2a, left). Formation of 
PS* structures required LLO, consistent with a role for membrane damage 
in their formation (Fig. 2b, c). PS* structures were also observed in Henle- 
407 cells and murine bone-marrow-derived macrophages (BMDMs) 
(Extended Data Fig. 6). 

Several lines of evidence indicated that PS* structures are associated 
with protrusion formation. First, we observed filamentous PS* struc- 
tures connected to bacteria that resembled protrusions (Fig. 2b, top). 
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Figure 2 | Formation of PS* structures during L. monocytogenes infection. 
a, HeLa cells were infected at an MOI of 100 with wild-type L. monocytogenes 
expressing RFP (red) for 6h and then incubated with a probe for exofacial PS 
(annexin A5—Alexa 488; green). Cells were then fixed and analysed by 
fluorescence microscopy to identify PS* structures and L. monocytogenes 
(right). Scanning electron microscopy (SEM) of the same cell shows PS* 
structures (left). Images are representative of two independent experiments. 
Scale bars, 5 um. b, Confocal images of cells infected with the indicated strain 
for 6 h. Boxed areas (left) are low-magnification images (a three-colour merge, 
includes phalloidin stain for F-actin); they are enlarged on the right. Arrows 
indicate L. monocytogenes bacteria that co-localize with PS, and arrowhead 
indicates PS* filaments. Images are representative of three independent 
experiments. Scale bars, 10 1m for low-magnification merge, and 3 jm for 
enlarged panels. c, Cells containing PS* bacteria enumerated for b. Averages + 
s.d. for three independent experiments are shown (n = 100). P values were 
calculated using one-way ANOVA. **P < 0.01, ***P < 0.001. 
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Second, AactA mutants did not form these structures (Fig. 2b, c). Third, 
we observed that the kinetics of formation of PS* structures were similar 
to the kinetics of protrusion formation (marked by ezrin”; Fig. 3a). Live 
cell imaging revealed that bacteria-containing protrusions (marked by 
LifeAct-RFP) rapidly recruited the exofacial PS probe, before round- 
ing of the PS” protrusion into vesicles on the cell surface (Fig. 3b and 
Supplementary Video 1). Recruitment of the exofacial PS probe was 
limited to protrusions and vesicles, indicating that loss of membrane 
asymmetry was not the result of global cellular membrane damage. 
Annexin A2 was localized to PS* bacteria (Extended Data Fig. 1c), sug- 
gesting that membrane repair pathways serve to locally restrict mem- 
brane damage. 

Streptolysin O, a pore-forming toxin from group A Streptococcus, 
has been shown to induce membrane blebbing and release of plasma- 
membrane- derived vesicles'*". Our studies indicated that L. monocy- 
togenes similarly induces the release of membrane vesicles through LLO. 
PS* structures containing bacteria were observed on coverslips with no 
connections to infected cells (Extended Data Fig. 7a). We also isolated 
these vesicles by centrifugation of the medium (Extended Data Fig. 7b). 
Our findings suggest that bacteria-containing protrusions are released 
by infected host cells, giving rise to PS* vesicles in the medium and that, 
in some cases, these vesicles remain associated with the cell surface of 
infected cells. Deletion of both PLCs caused an increase in the number 
of PS* structures associated with the surface of infected cells (Fig. 2b, c). 
Thus, PLCs may promote bacterial escape from PS” structures after 
their release from infected cells, which would be consistent with the role 
of PLCs in escape from vacuoles in secondarily infected cells*. 

Efferocytosis receptors mediate uptake of PS* cells and cellular debris’. 
In macrophages, TIM-4 has a key role in efferocytosis**”’. Because mac- 
rophages are a major target of L. monocytogenes during systemic infection, 
we examined the role of TIM-4 in cell-to-cell spread by these bacteria 
using an infection focus assay. The foci of infection in BMDMs from 
Timd4 ‘~ mice contained fewer infected cells compared with control 
BMDMs from C57BL/6 mice (Fig. 4a, b). Using live cell imaging, we 
observed that infection foci were smaller in BMDM cultures from 
Timd4 ‘~ mice from 8 to 18 h post-infection (Fig. 4c, d). Bacterial repli- 
cation within primarily infected Timd4 ‘~ BMDMsat low cell density 
(minimizing cell-to-cell spread) was comparable with that of control 
BMDMs (Extended Data Fig. 8). This indicated that deficient bacterial 
spread in Timd4 ‘~ BMDM cultures was not due to impaired bacterial 
replication in the cytosol of these cells. Blocking antibodies that targeted 
either TIM-4 or PS impaired bacterial spread in cultures of control but 
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Figure 3 | Protrusions give rise to PS* vesicles containing L. 
monocytogenes. a, PS" bacteria and ezrin* protrusions were enumerated in 
HeLa cells infected at an MOI of 100 with wild-type L. monocytogenes for 
the indicated time. Averages + s.d. for three independent experiments are 
shown (n = 100). b, Cells were transfected with LifeAct-RFP (F-actin probe) 
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not Timd4 ’~ BMDMs (Fig. 4b), indicating that TIM-4 promotes bac- 
terial cell-to-cell spread through its ability to bind PS” structures. 

TIM-4 was previously shown to suppress pro-inflammatory cyto- 
kine production in vivo”. Therefore, we considered the possibility that 
TIM-4 deficiency impairs cell-to-cell spread by L. monocytogenes indi- 
rectly, through enhanced pro-inflammatory cytokine production. How- 
ever, in our in vitro experiments, pro-inflammatory cytokine production 
was typically not observed until 18 h post-infection (after bacterial spread 
is initiated) and shared the same profile of expression in both control 
and Timd4 ’~ BMDM cultures (Extended Data Fig. 9a). This suggested 
that TIM-4 acts in a cell-autonomous manner to promote cell-to-cell 
spread by L. monocytogenes. 

To test this hypothesis, we used a cell-to-cell spread assay that mea- 
sures the direct transmission of bacteria from primary infected cells to 
secondary cells (labelled with CellTrackerBlue). Primary cells were infected 
with a bacterial mutant lacking both PLCs (AplcAAplicB), as this pre- 
vents bacterial escape from spreading vacuoles and avoids the confound- 
ing effect of rapid replication in the cytosol of secondary cells. Bacterial 
spread to secondary BMDMs from Timd4 ‘~ mice was reduced com- 
pared with control mice (Fig. 4e). Microscopic analysis revealed higher 
numbers of actin” protrusions associated with secondary BMDMs from 
control mice compared with Timd4~’~ mice (Fig. 4f, g). This suggests 
that TIM-4 may promote the association and/or stability of protru- 
sions as they project from infected cells into adjacent cells. Together, 
our findings indicate a specific role for TIM-4 in mediating cell-to-cell 
spread by L. monocytogenes. 

We observed decreased numbers of bacteria in the liver and spleen 
of Timd4-‘~ mice compared with control mice (Fig. 4h). Similar num- 
bers of AactA mutant bacteria were observed in the liver of control and 
Timd4 ’~ mice (Fig. 4i), indicating that TIM-4 is required for patho- 
genesis of motile bacteria in this organ and consistent with its role in 
promoting bacterial cell-to-cell spread. However, we observed a decrease 
in survival of AactA mutant bacteria in spleens of infected Timd4 ‘~ 
mice. We also observed elevated basal levels of pro-inflammatory cyto- 
kines in the spleens (and to a lesser extent the livers) of Timd4 ‘~ mice 
before infection (Extended Data Fig. 9b). Together, our findings sug- 
gest that L. monocytogenes exploits TIM-4 during systemic infection of 
mice by two mechanisms; directly, through uptake of PS* bacteria, and 
indirectly, through the ability of TIM-4 to suppress pro-inflammatory 
cytokine production. 

It has previously been demonstrated that TIM-4 mediates the clear- 
ance of apoptotic macrophages bearing Mycobacterium tuberculosis”. 


and then infected with wild-type L. monocytogenes expressing GFP for 6 h. Live 
infected cells were analysed by spinning-disk confocal microscopy with 
annexin A5—Alexa 647 in the medium to label exofacial PS (green). Successive 
frames are shown. Arrows indicates protrusion that acquires PS. Images are 
representative of four independent experiments. Scale bar, 10 jum. 
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This finding indicates an important role for efferocytosis in innate 
immunity. By contrast, our findings suggest that efferocytosis can be 
exploited by L. monocytogenes to promote cell-to-cell spread. Other mech- 
anisms also contribute to L. monocytogenes dissemination in a host”*”**. 
Viruses are known to exploit efferocytosis by incorporating PS into their 
viral membrane”””’. L. monocytogenes does not incorporate PS into its 
cellular wall but rather exploits host membrane repair mechanisms 
to promote release of protrusion-associated membrane, giving rise to a 
PS* ‘cloak’ that can promote efferocytosis (see model in Extended 
Data Fig. 10). Similar forms of non-lytic ejection of bacteria from host 
cells have been described for Mycobacterium marinum” and Chlamydia 
trachomatis”, indicating that these (and probably other) pathogens 
may also exploit efferocytosis during infection. In summary, our study 
identifies a novel strategy used by pathogenic bacteria to promote their 
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Figure 4 | TIM-4 promotes L. monocytogenes cell-to-cell spread in 
macrophages and growth in mice. a, Infection focus assay for measuring 
cell-to-cell spread. BMDMs were infected with L. monocytogenes at an MOI of 
0.01. Images are representative of three independent experiments. Dotted lines 
delineate infection foci. Scale bars, 100 tm. b, Infection foci from a were 
enumerated by fluorescence microscopy. A total of 100 infection foci were 
analysed for each genotype/strain. Where indicated, cells were treated with 
anti-PS, anti-TIM-4 or non-specific rat IgG (control) during infection. 

WT, wild type. Averages + s.d. for three independent experiments are shown. 
P values were calculated using two-way ANOVA with Bonferroni post-test. 

c, Monolayers of control or Timd4-‘~ BMDMs were infected at an MOI of 0.01 
with L. monocytogenes expressing RFP and examined by live cell imaging. 
Images are representative of two independent experiments. Dotted lines 
delineate infection foci. Scale bars, 100 um. d, The number of infected cells per 
focus from c. Averages + s.d. for two independent experiments are shown 

(n = 10). P values were calculated using two-tailed Student’s t-test. e, Cell-to- 
cell spread assay. Averages + s.d. for three independent experiments are shown 
(n = 100). P values were calculated using two-tailed Student’s t-test. f, Images 
from e. Arrows indicate actin® protrusions associated with secondary cells. 
Images are representative of four independent experiments (n = 100). Scale 
bars, 10 um. g, Actin protrusions associated with secondary cells were 
enumerated. Averages + s.d. for four independent experiments are shown. 

h, i, Mice were infected with 5 X 10* colony-forming units (c.f.u.) wild-type 
(h) or 1 X 10’ c.fu. AactA mutant (i) L. monocytogenes by tail vein injection. 
Mice were killed and livers and spleen were harvested at 72 h after infection for 
quantification of bacterial load (c.f.u. per gram tissue). Data are expressed as 
mean + standard error of the mean. P values were calculated using one-tailed 
Mann-Whitney test. 


dissemination in a host and provides a remarkable example of how 
pathogens can exploit innate immune defences during infection. 


METHODS SUMMARY 


All L. monocytogenes strains were derivatives of 10403S and are listed in Methods. 
Six-to-eight-week-old C57BL/6 and Timd4 ’~ mice were used to generate BMDMs 
or were infected intravenously for systemic infection studies. Cytokine levels were 
measured using a mouse 7-plex pro-inflammatory kit. In vitro infections of con- 
fluent cell monolayers were performed at an MOI of 0.1-100, as indicated in figure 
legends, and were performed in the presence of gentamicin to prevent extracellular 
growth. Constructs, antibodies, siRNA and other pharmacological reagents are 
listed in Methods. PI staining was carried out 6 h post-infection in Tyrode’s buffer 
in the presence or absence of calcium. Annexin A5-Alexa 488 was added to chilled 
cells for exofacial PS staining. Staining protocols are indicated in Methods. For the 
infection focus assay, confluent monolayers of BMDMs were infected for 18 h and 
the size of infection foci was analysed. For the direct cell-to-cell spread assay, primary 
cells were lifted with CellStripper and overlaid onto fresh cultures of CellTrackerBlue- 
labelled cells for 90 min. 

Unless indicated, cells were fixed with paraformaldehyde. Quantifications were 
done on a Leica DMIRE2 epifluorescence microscope with X63 objective. Corre- 
lative electron microscopy images were acquired from dehydrated samples gold 
coated in a Denton Desk II sputter coater and examined under a FE] XL30 SEM. 
Confocal z stacks (2 um) were taken with a Quorum spinning disk microscope, 
analysed with Volocity 6 software and imported into Adobe Illustrator. Nikon Ti- 
E microscopy was used to acquire images of infection foci at X10 magnification. 

The mean plus s.d. is shown in figures. P values were calculated using GraphPad 
Prism v.4.0a using statistical tests indicated in figure legends. A P value less than 
0.05 was considered significant. 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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METHODS 

Bacterial strains. L. monocytogenes were grown in brain-heart infusion (BHI) broth 
and the following strains were used: 10403S (wild type)*', DP-L2161 (Ahly), 
DP-L4818 (Ahly + hly)*?, DP-L3078 (AactA)**, and DP-L1936 (AplcA AplcB)*. 
Wild-type L. monocytogenes 10403S expressing TagRFP under the actA promoter 
(DP-L5538)’* or expressing GFP under the hly promoter (DP-L1039) were previ- 
ously described**. 

Antibodies, constructs and reagents. Primary antibodies used were rabbit anti- 
annexin A2 (#610068 from BD Biosciences), rabbit anti-GFP (#A11120 from Invi- 
trogen), rabbit anti-caspase 7 (#12827 from Cell Signaling), rabbit anti-Listeria 
(#B223021 from BD Biosciences), rabbit anti-L. monocytogenes antibody (gift from 
P. Cossart), mouse anti-phosphatidylserine (#18005 from Abcam) and mouse anti- 
ezrin (#35-7300 from Invitrogen). Rat anti-TIM-4 blocking antibody was previ- 
ously described*’. Alexa Fluor-568 Phalloidin, annexin A5-Alexa Fluor-488 and -647 
conjugates and all fluorescent secondary antibodies (AlexaFluor conjugates) were 
from Invitrogen. DAPI (#D1306 from Invitrogen) was used at 1:2,500 dilution to 
stain the nuclei where indicated. Cytochalasin D (#2502555; 101M final) and 
Latrunculin B (#428020; 10 pg ml~ | final) were from Calbiochem. For transfection 
of HeLa cells, Xtreme Gene 9 (Roche) transfection reagent was used as per manu- 
facturer’s protocols. LifeAct-mRFP* was a gift from R. Truant. The following siRNAs 
were from Sigma: annexin A1 (#00157996), annexin A2 (#00246294), annexin A6 
(#00063383), caspase 7 (#00128361). 

Cell culture and macrophage generation. HeLa and Henle-407 cells were cul- 
tured in DMEM (Hyclone) supplemented with 10% heat-inactivated FBS (Wisent) 
without antibiotics at 37 °C and 5% CO . 

All experimental protocols involving mice were approved by the Animal Care 
Committee of The Hospital for Sick Children. Mice were euthanized by CO, 
inhalation. Mouse BMDMs were obtained from the dissected femurs and tibias of 
5-8-week-old mice. Cells were washed with growth medium and plated on 70 cm* 
Petri dishes. Medium was replaced every 2 days and after 7-9 days cells were used for 
experiments. Cells were maintained in high-glucose RPMI-1640 medium (Wisent 
#350-025-CL) containing 10% heat-inactivated FBS (Wisent), 1% sodium pyruvate 
(Wisent), 1% non-essential amino acids (Wisent), 0.5% 2-mercaptoethanol (Gibco), 
1% penicillin and streptomycin (Invitrogen), and 10% L929 medium. L929 medium 
was generated by growing a confluent layer of L929 cells in 175 cm” flasks in DMEM 
supplemented with 10% heat-inactivated FBS. When cells reached confluency, growth 
medium was replaced by DMEM alone. After 7 days, supernatant was collected, 
centrifuged, filtered, and stored at —20 °C. 

PI assay for membrane damage. HeLa cells were plated at 5 X 10* cells per well in 
24-well tissue culture plates ~18h before infection. For siRNA treatments, the 
cells were plated at 2.5 X 10* cells per well. siRNAs were applied after 18 h and the 
media was changed 24h later. Infections were carried out 24h later. All strains of 
L. monocytogenes were infected at an MOI of 100. Bacteria were spun onto cells by 
centrifugation at 1,500 r.p.m. for 5 min. After 60 min of invasion at 37 °C, cells were 
washed three times with PBS with calcium and magnesium (PBS+, Wisent #311- 
420-CL) followed by the addition of growth medium containing 50 pg ml’ genta- 
micin (Wisent #400-130-IG). At 6h post-infection, cells were washed two times 
with PBS without calcium and magnesium (PBS—, Wisent #311-010-CL) and replaced 
with Tyrode’s Buffer (10 mM HEPES, 10 mM glucose, 5 mM potassium chloride, 
140 mM sodium chloride, 1 mM EGTA, 1 mM magnesium chloride, 2 mM calcium 
chloride, pH 7.4) containing 0.5 mg ml PI (Sigma #P4170). For Tyrode’s Buffer 
lacking calcium the 2 mM calcium chloride was replaced with 2mM magnesium 
chloride. Cells were incubated in Tyrode’s Buffer with PI for the indicated times and 
then fixed with 4% PFA (EM Sciences #15710). Samples were mounted on slides 
using fluorescence mounting medium (Dako). Quantifications were done using a 
Leica DMIRE2 epifluorescence microscope equipped with a X40 objective. 
Annexin A5-Alexa 488 staining of exofacial PS. HeLa cells were plated at 5 X 10* 
cells per well in 24-well tissue culture plates ~18 h before infection. All strains of 
L. monocytogenes were infected at an MOI of 100 in DMEM. Bacteria were spun 
onto cells by centrifugation at 1,500 r.p.m. for 5 min. After 60 min of invasion at 
37 °C, cells were washed three times with PBS+ followed by the addition of growth 
media containing 50 pg ml‘ gentamicin (Wisent #400-130-IG). After indicated 
times of infection, cells were cooled on ice and washed twice with chilled PBS+. 
Annexin A5-Alexa Fluor-488 conjugate (Invitrogen) was diluted to 1% (v/v) in chilled 
PBS with calcium and magnesium and added onto the coverslips for 10 min on ice. 
Cells were washed twice with chilled PBS+ and fixed with 2.5% paraformaldehyde 
(PFA) (EM Sciences #15710) for 30 min at 37 °C. Samples were mounted on slides 
using fluorescence mounting medium (Dako). 

Immunofluorescence. Immunostaining was conducted as previously described”. 
Briefly, cells were permeabilized and blocked in PBS containing 0.2% saponin (Cal- 
biochem) and 10% normal goat serum (SS-PBS) for 30 min. Subsequently cells 
were incubated for 1 h with primary antibodies in SS-PBS. Cells were washed three 
times with PBS and incubated with secondary Alexa Fluor conjugated antibodies 
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for 1h. Cells were washed three times with PBS, mounted in fluorescence mount- 
ing medium (Dako), and analysed using a Leica DMIRE2 epifluorescence micro- 
scope. Confocal pictures were taken using a Quorum spinning disk microscope 
(Leica DMIRE2 inverted fluorescence microscope equipped with a Hamamatsu 
back-thinned electron multiplying charge-coupled device camera, spinning disc 
head, and Volocity 4 software (Improvision)). Confocal images were imported into 
Adobe Photoshop and assembled in Adobe Illustrator for labelling. 

Correlative light-scanning electron microscopy. HeLa cells were plated at 5 X 10° 
cells per well in 6-well tissue culture plates with etched grid coverslips with imprinted 
numbers (BELLCO Biotechnology) 24h before infection. Cells were infected with 
wild-type L. monocytogenes expressing RFP (DP-L5538) at an MOI of 100 in DMEM. 
Bacteria were spun onto cells by centrifugation at 1,500 r.p.m. for 5 min. After 60 min 
of invasion at 37 °C, cells were washed three times with PBS+ followed by the addi- 
tion of growth media containing 50 jg ml! gentamicin (Wisent #400-130-IG). At 
6h post-infection, cells were cooled on ice and washed twice with chilled PBS+. 
Annexin A5-Alexa Fluor 488 conjugate (Invitrogen) was diluted to 1% (v/v) in 
chilled PBS with calcium and magnesium and added onto the coverslips for 10 min 
on ice. Cells were washed twice with chilled PBS+ and fixed with 2.5% PFA (EM 
Sciences #15710) for 30 min at 37 °C. Coverslips were imaged by fluorescence micros- 
copy in PBS+. Subsequently, samples were fixed in 2% glutaraldehyde in cacodylate 
buffer, rinsed in buffer and dehydrated in a graded ethanol series. The samples were 
critical point dried in a Bal-tec CPD030 critical point dryer, mounted on alumi- 
nium stubs, gold coated in a Denton Desk II sputter coater and examined in an FEI 
XL30 SEM. 

Live cell imaging of PS* structure formation. Hela cells were plated at 2 x 10° 
cells per well in 6-well tissue culture plates with glass coverslips 48 h before infec- 
tion. At 24h before infection, cells were transfected with LifeAct-RFP. Cells were 
then infected with wild-type L. monocytogenes expressing GFP (DP-L1039) at an 
MOI of 100 in DMEM. After 60 min of invasion at 37 °C, cells were washed three 
times with PBS+ followed by the addition of growth media containing 50 pg ml! 
gentamicin (Wisent #400-130-IG). At 6h post-infection, coverslips were washed 
with PBS+ and transferred into microscope chambers and RPMI-1640 with 10% 
heat-inactivated FBS (Wisent), 50 pg ml! gentamicin, 2.5 mM CaCl, and 2% (v/v) 
annexin A5—Alexa Fluor 647 conjugate (Invitrogen). HeLa cells were maintained at 
37 °C during imaging. A Leica DMIRE2 inverted fluorescence microscope equipped 
with a Hamamatsu Back-Thinned EM-CCD camera and spinning disk confocal 
scan head with a X63 objective and LSM 510 software was used. Volocity software 
(Improvision) was used to acquire images. 

Animals. Timd4 /~ mice (on a C57BL/6 background) were previously character- 
ized” and bred in house at the Hospital for Sick Children Animal Care Facility. 
C57BL/6 mice, originally from The Jackson Laboratory, were also bred in house 
and used as controls. All experiments were performed with 7-9-week-old female 
mice that were maintained on a 12h light-dark cycle, with food and water avail- 
able ad libitum. Sample size was chosen based on litter size and where possible five 
mice were used per experiment per group and per bacterial strain. Experiments 
were not blinded and mice were not randomized. All experiments described in this 
study were carried out in accordance with the Guide for the Humane Use and Care 
of Laboratory Animals and were approved by the Hospital for Sick Children’s Animal 
Care Committee. 

Infection focus assay. After 7-10 days of differentiation, BMDMs were washed 
twice and detached with ice-cold Versene buffer (0.8 mM EDTA, 1 mM glucose in 
PBS—) for 20 min at 4°C. BMDMs were then seeded onto coverslips in 24-well 
tissue culture plates at 8.0 X 10° cells per well to generate a monolayer. After 18h, 
the monolayer was infected with wild-type L. monocytogenes at an MOI of 0.01 in 
RPMI-1640. At 30 min post-infection, cells were washed three times with PBS and 
cultured in RPMI-1640 medium containing 10% FBS. At 60 min post-infection, 
cells were washed three times with PBS and RPMI-1640 containing 10% FBS, 10% 
L929 conditioned medium, and 50 1g ml! gentamycin was added to the cultures. 
At 18h post-infection, cells were fixed with 2.5% PFA for 30 min at 37°C and 
prepared for fluorescence microscopy. The number of infected cells per focus of 
infection was quantified using epifluorescence microscopy. Images for analysis 
were taken with a Hamamatsu Orca R2 camera and Nikon Ti-E microscope with 
X10 objective and Volocity software (Improvision). 

Live cell imaging of L. monocytogenes cell-to-cell spread. BMDMs were plated 
asa monolayer at 4 X 10° cells per well onto coverslips in a 6-well tissue culture plate. 
After 24 h, the monolayer was infected with L. monocytogenes expressing RFP (DP- 
L5538) at an MOI of 0.01 for 1h. Cells were washed three times with PBS and 
incubated at 37 °C in RPMI containing 10% FBS, 10% L-929 conditioned medium 
and 50 1g ml”! gentamycin. At 6h post-infection, the coverslips were imaged in 
25mM HEPES buffered RPMI containing 10% FBS and 50g ml * gentamycin 
using a Quorum spinning disk confocal microscope (Leica DMIRE2 inverted 
fluorescence microscope equipped with a Hamamatsu back-thinned electron mul- 
tiplying charge-coupled device camera, Yokogawa spinning disc head, and Volocity 
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6 software). Coverslips with BMDMs from control and Timd4 /~ mice were placed 
side by side on a dual-chamber heated stage at 37 °C. Over the course of 12 h, 36 um 
z stacks with a 2 um step were taken every 15 min at ten foci of infection per 
coverslip. The channels for DIC and red fluorescence were acquired throughout 
the experiment. Image analysis was performed to measure the number of infected 
cells per infection focus during the course of the experiment. 

Cell-to-cell spread assay. BMDMs from C57BL/6 mice were seeded at a density of 
1 X 10° in 10. cm? Petri dishes 16h before experiments. Cultures of AplcA AplcB 
mutant bacteria were used to infect BMDMs at an MOI of 10. At 60 min post- 
infection, cells were washed three times with PBS and RPMI medium containing 
10% FBS and 50 jg ml”! gentamycin was added to the cultures. At 3 h post-infection, 
cells were detached using CellStripper (Mediatech) containing 50 pg ml * genta- 
mycin and counted. 1 X 10° infected cells were then overlaid onto fresh cultures of 
CellTracker Blue-labelled BMDMs from C57BL/6 or Timd4 ’~ mice. At 90 min 
after overlay, cells were fixed with 2.5% PFA for 30 min at 37 °C and prepared for 
fluorescence microscopy. The uptake of bacteria from infected primary cells (unla- 
belled) to secondary cells (CellTracker Blue labelled) was determined by micro- 
scopic analysis. 

Labelling of secondary host cells with CellTracker Blue (Invitrogen) was performed 
as follows. Secondary cells were plated onto coverslips in 24-well tissue culture 
plates at 1.5 X 10° cells per well. At 2h before overlay, serum-free RPMI medium 
containing 20 1M CellTracker Blue was added to the culture. After 30 min incuba- 
tion, cells were washed three times with PBS and RPMI medium containing 10% 
FBS was added. 

Macrophage replication assay. BMDMs were plated at 3 X 10° cells per well in 
24-well tissue culture plates 24h before infection. Cells were then infected with 
L. monocytogenes at an MOI of 1 in RPMI-1640. After 30 min of invasion at 37 °C, 
cells were washed three times with PBS followed by the addition of RPMI-1640. At 
60 min post-infection, media was changed, and RPMI-1640 with 10% FBS contain- 
ing 50 pg ml ~ : gentamicin and 10% L929 was added. Cells were then lysed at 2, 4, 6, 
8, 10 and 12h post-infection with 1% Triton X-100 in PBS. Serial dilutions of the 
lysates were plated on BHI-agar plates and incubated at 37°C for 16h for sub- 
sequent quantification of intracellular c.f.u. 

Mouse infection and tissue collection. Mice were infected with 5 X 10*c.f.u. 
wild-type or 1 X 10’ c.f£.u. AactA mutant L. monocytogenes (in 200 il of PBS) via 


intravenous injection in the lateral tail vein. Liver and spleen were collected at 72h 
post-infection. Organs were placed in 1 ml of sterile PBS and homogenized. A 
dilution series was plated on BHI plates, grown overnight at 37 °C and individual 
bacterial colonies were enumerated. 

For cytokine analysis, cell culture supernatants and tissue homogenates were cen- 
trifuged (12,000 r.p.m. for 12 min) and supernatants were collected and stored at 
—80 °C. Cytokines were measured using the mouse 7-plex pro-inflammatory kit 
from Meso Scale Discovery, according to the manufacturer’s instructions. 
Statistical analysis. Statistical analyses were conducted using GraphPad Prism v.4.0a. 
The average + s.d. is shown in figures, and P values were calculated as described in 
figure legends. A P value of less than 0.05 was considered statistically significant. 
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Extended Data Figure 1 | Annexins promote membrane repair during 
L. monocytogenes infection. a, HeLa cells were treated with the indicated 


siRNA for 48 h and then infected with wild-type L. monocytogenes at an MOI of 
100. At 6h post-infection, medium was switched to Tyrodes buffer containing 


0.5mgml ’ PI with or without calcium. Cells were fixed at 60 min after PI 


addition and then stained for bacteria and DNA (DAPI). The percentage of 100 


random infected cells that were PI* cells were enumerated by microscopic 
analysis. Averages + s.d. for three independent experiments are shown. 
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P values were calculated using one-way ANOVA. *P < 0.05. b, Knockdown of 
gene expression by siRNA was confirmed by western blotting. Images are 


representative of two independent experiments. c, Recruitment of annexin A2 
to PS" structures containing bacteria. Boxes in low-magnification image 


indicate areas enlarged in bottom panels. Arrows indicate PS* structures that 


co-localize with annexin A2. Images are representative of three independent 


experiments. Scale bars, 10 jm for low-magnification images, 2 1m for 


enlarged regions of interest. 
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Extended Data Figure 2 | Actin-based motility promotes LLO-mediated 
membrane damage during L. monocytogenes infection. a, HeLa cells were 
infected with the indicated L. monocytogenes strain. At 6h post-infection, 
medium was switched to Tyrodes buffer containing 0.5mg ml’ PI with or 
without calcium. Cells were fixed at 60 min after PI addition and then stained 
for bacteria and DNA (DAPI). Confocal images representative of three 
independent experiments are shown (n = 100). PI* cells were enumerated and 


Ahly + hly 0.2 % Saponin 


results are shown in Fig. 1d. Where indicated, uninfected cells were treated with 
saponin to permeabilize membranes and allow PI entry, serving as a positive 
control. Scale bars, 10 um. b, HeLa cells were infected with wild-type L. 
monocytogenes and subjected to membrane damage assay as in a in the presence 
of either DMSO or the actin cytoskeleton inhibitors latrunculin B or 
cytochalasin D. Averages + s.d. for three independent experiments are shown 
(n = 100). P values were calculated using two-tailed Student’s t-test. *P < 0.05. 
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Extended Data Figure 3 | Annexin A5-Alexa 488 asa probe to label PS. Live cells, due to asymmetry of PS distribution in the plasma membrane. By 


HeLa cells were cooled on ice and stained with a fluorescent probe (annexin contrast, treatment of cells with the pore-forming surfactant saponin led to 
A5—Alexa 488) for 10 min to label exofacial PS. Cells were then fixed and robust staining of cells with annexin A5-Alexa 488. Images representative of 
stained with phalloidin Alexa 568 to visualize F-actin. In uninfected control three independent experiments. Scale bars, 10 um. 


experiments, low amounts of exofacial PS was detected in the membranes of 


©2014 Macmillan Publishers Limited. All rights reserved 


LETTER 


DIC RFP- Listeria Exofacial PS Merge SEM 


Extended Data Figure 4 | Formation of PS* structures during identify PS* structures and bacteria. SEM of the same cell revealed that PS* 
L. monocytogenes infection. Low-magnification images used to generate structures were associated with the dorsal surface of infected cells. Differential 
images shown in Fig. 2a. HeLa cells were infected with wild-type interference contrast (DIC) microscopy of cells was also performed to help 


L. monocytogenes expressing RFP for 6h and then cooled on ice and stained _ identify cells for correlative imaging analysis. Images are representative of two 
with a fluorescent probe (annexin A5-Alexa 488) for 10 min to label exofacial independent experiments. Scale bars, 20 um. 
PS. Cells were then fixed and analysed by fluorescence microscopy to 
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Extended Data Figure 5 | PS* bacteria are present with a host-derived 
membrane structure. a, HeLa cells were infected with wild-type 

L. monocytogenes expressing RFP for 8 h and then labelled with a probe for 
exofacial PS (annexin A5-Alexa 488). Cells were then rapidly stained with 
anti-Listeria antibodies (5 min) to label extracellular bacteria. Cells were then 
fixed and analysed by fluorescence microscopy to identify PS* structures and 
bacteria. Bacteria that co-localize with exofacial PS but are not labelled with 
anti-Listeria antibodies in the extracellular medium are indicated with arrows. 
Extracellular bacteria do not label with annexin A5-Alexa 488, indicating that 
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this probe does not bind non-specifically to bacteria. Box in low-magnification 
image indicates area enlarged in bottom panels. Images are representative of 
three independent experiments. Scale bars, 10 jum for low magnification, 2 1m 
for high magnification. b, Cells were infected and stained as in a and analysed 
by fluorescence microscopy. Bacteria that co-localize with exofacial PS were 
scored for their accessibility to anti-Listeria antibodies present in the 
extracellular medium. Data show that the majority of PS* bacteria are not 
accessible to anti-Listeria antibodies. Averages + s.d. for two independent 
experiments are shown (n = 100). 
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Extended Data Figure 6 | Formation of PS* structures during 


phalloidin to visualize F-actin (red) and bacteria (blue). Cells were analysed by 
L. monocytogenes infection of epithelial cells and macrophages. 


fluorescence microscopy to identify PS* structures and bacteria. Images are 
a, Henle-407 human intestinal epithelial cells were infected with wild-type representative of three independent experiments. b, Mouse BMDMs from 

L. monocytogenes for 6h and then incubated with a probe for exofacial PS C57BL/6 mice were infected and stained as in a. Scale bars, 10 jum. Images are 
(annexin A5-Alexa 488; green). Cells were then fixed and stained with representative of three independent experiments. 
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Extended Data Figure 7 | Release of PS* structures containing mutant bacteria for 6h. The supernatant from the infected cultures was then 
L. monocytogenes from infected cells. a, HeLa cells were infected with removed and centrifuged onto poly-L-lysine-coated coverslips. Bacteria 
wild-type L. monocytogenes for 6h and then incubated with a probe for associated with coverslips were then stained with a probe for exofacial PS 
exofacial PS (annexin A5—Alexa 488; green). Cells were then fixed and stained (annexin A5-Alexa 488; green). Cells were then fixed and stained for bacteria 
with phalloidin to visualize F-actin (red) and bacteria (blue). Cells were (blue). Coverslips were analysed by fluorescence microscopy to identify ps* 
analysed by fluorescence microscopy to identify PS* structures and bacteria. _ bacteria. Inset shows PS” bacteria. Scale bars, 10 jim for low-magnification 


Inset shows PS* bacteria that are not cell associated. Images are representative images, 2 jm for insets. Images are representative of three independent 
of three independent experiments. b, HeLa cells were infected with AplcAAplcB —_ experiments. 
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Extended Data Figure 8 | Growth of L. monocytogenes in the cytosol of 
Timd4~’~ macrophages is not impaired. Gentamicin protection assay to 
measure intracellular bacterial growth. BMDMs were harvested from C57BL/6 
or Timd4 ’~ mice and seeded at a density of 3 X 10° cells per well. Cells 
were then infected with wild-type L. monocytogenes in the presence of 
extracellular gentamicin. At the indicated times, cell lysates were plated and 
intracellular bacterial numbers (c.f.u.) were determined. Averages + s.d. for 
two independent experiments are shown. 
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Extended Data Figure 9 | Cytokine measurements. a, Measurement of tissues of C57BL/6 or Timd4‘~ mice without infection. Averages + s.d. for 
cytokines after in vitro infection of BMDMs from C57BL/6 or Timd4'~ mice _ three independent experiments are shown. P values were calculated using 
with wild-type L. monocytogenes for the indicated time. Data from one of two one-way ANOVA. 

independent experiments are shown. b, Measurement of basal cytokines in 
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Extended Data Figure 10 | L. monocytogenes exploits efferocytosis to 
promote cell-to-cell spread during infection. Model shows the steps that 
promote cell-to-cell spread by L. monocytogenes. 1. Protrusion formation 

via actin-based motility. 2. LLO-mediated damage to the plasma membrane 
leads to loss of membrane asymmetry and exofacial PS on protrusions. The 
exofacial exposure of PS promotes protrusion association with neighbouring 
cells (right). 3. Loss of membrane asymmetry and PS exposure extends along 
length of protrusions. 4. Calcium entry activates membrane repair pathways 
that promote scission of the protrusion. Bacteria are released from the cell in 


Infected Cell 


PS" vesicles. 5. Macrophages mediate uptake of PS* vesicles containing 
bacteria via the PS-binding receptor TIM-4. PS* vesicles may be engulfed by 
neighbouring cells either near the infected cell surface (left side) or within 
enclosed spaces that form as a result of protrusion penetration into the 
neighbouring cell (right side). TIM-4 may also promote L. monocytogenes 
infection indirectly, through its ability to suppress basal levels of 
pro-inflammatory cytokines as part of its homeostatic function in the 
immune system. 
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NRROS negatively regulates reactive oxygen species 
during host defence and autoimmunity 
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Reactive oxygen species (ROS) produced by phagocytes are essen- 
tial for host defence against bacterial and fungal infections. Indi- 
viduals with defective ROS production machinery develop chronic 
granulomatous disease’”. Conversely, excessive ROS can cause col- 
lateral tissue damage during inflammatory processes and therefore 
needs to be tightly regulated. Here we describe a protein, we termed 
negative regulator of ROS (NRROS), which limits ROS generation 
by phagocytes during inflammatory responses. NRROS expression 
in phagocytes can be repressed by inflammatory signals. NRROS- 
deficient phagocytes produce increased ROS upon inflammatory 
challenges, and mice lacking NRROS in their phagocytes show 
enhanced bactericidal activity against Escherichia coli and Listeria 
monocytogenes. Conversely, these mice develop severe experimental 
autoimmune encephalomyelitis owing to oxidative tissue damage in the 
central nervous system. Mechanistically, NRROS is localized to the 
endoplasmic reticulum, where it directly interacts with nascent NOX2 
(also known as gp91?"™ and encoded by Cybb) monomer, one of 
the membrane-bound subunits of the NADPH oxidase complex, 
and facilitates the degradation of NOX2 through the endoplasmic- 
reticulum-associated degradation pathway. Thus, NRROS provides 
a hitherto undefined mechanism for regulating ROS prodution—one 
that enables phagocytes to produce higher amounts of ROS, if 
required to control invading pathogens, while minimizing unwanted 
collateral tissue damage. 

In response to microorganisms and inflammatory stimuli, profes- 
sional phagocytes can generate ROS either within mitochondria or 
through a process named oxidative burst mediated by the NADPH 
oxidase 2 (NOX2) complex’*. Although many regulatory factors for 
ROS production in phagocytes have been described, our knowledge about 
its precise control is still limited'’**. Although priming bone marrow- 
derived macrophages (BMDMs) with either interferon-y (IFN-y) or 
lipopolysaccharide (LPS) was sufficient to induce ROS upon subse- 
quent zymosan stimulation, treatment with a combination of IFN-y 
and LPS had a synergistic effect (Fig. 1a)’°. Although IFN-y or LPS 
could enhance the expression of NOX2 (ref. 9), we speculated that there 
might be other mechanisms controlling this synergistic ROS produc- 
tion in phagocytes. Gene expression analysis by microarray under these 
conditions identified a previously uncharacterized gene, EMSMUSG 
00000052384, which we named Nrros (negative regulator of ROS, pre- 
viously known as Lrrc33) that was markedly downregulated upon prim- 
ing with a combination of IFN-y and LPS (Extended Data Fig. 1a). The 
predicted structure of NRROS contains a signal sequence, 21 putative 
leucine-rich repeat domains, a transmembrane domain, and a short 
cytoplasmic domain. NRROS was preferentially expressed in immune 
organs such as lymph nodes, thymus and spleen in mice (Extended Data 
Fig. 1b, c). Among leukocytes, myeloid cell subsets including macro- 
phages, neutrophils and dendritic cells had the highest NRROS expression 


(Fig. 1b and Extended Data Fig. 1d, e). Interestingly, priming with a 
combination of IFN-y and LPS or tumour necrosis factor (TNF)-« 
alone markedly repressed Nrros messenger RNA and protein express- 
ion in wild-type BMDMs (Fig. Ic, d). 

To reveal the biological functions of NRROS, we generated NRROS- 
specific antibody and NRROS-deficient mice (Extended Data Fig. 1f-j). 
At 6 weeks of age, all mice were viable and immune organs and leu- 
kocyte subsets were indistinguishable from those of wild-type mice 
(Extended Data Table 1 and data not shown). However, significantly 
augmented ROS production was observed from NRROS-deficient pri- 
mary BMDMs upon zymosan stimulation after priming for 24h with 
either IFN-y (Fig. le) or LPS (Fig. 1f). These observations were con- 
firmed in a variety of phagocytes, under several priming and activation 
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Figure 1 | NRROS is a negative regulator of ROS in phagocytes. 

a, Zymosan-induced ROS production in BMDMs primed as indicated. 

b, NRROS expression analysis by western blot in immune cells. Actin was used 
as loading control. KO, NRROS-deficient knockout; NS, nonspecific band; WT, 
wild type. c, d, mRNA (c) and protein (d) of NRROS in BMDMs treated as 
indicated. e-g, Zymosan-induced ROS production in WT and KO BMDMs 
primed with IFN-y (e), LPS (f) or both (g). a, e-g, Left, ROS kinetic plots of a 
representative experiment with three independent samples per group. Right, 
averaged area under curve (AUC) from five independent experiments. Error 
bars, s.e.m. *P < 0.05, **P < 0.01, ***P < 0.001. Unpaired Student’s t-test 
with Welch’s corrections, (a, e-g, left), paired Student’s t-test (a, e-g, right), 
unpaired Student’s t-test (c). RLU, relative light units. Data in b-d represent at 
least three independent experiments. 
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conditions using multiple ROS detection methods (Extended Data Fig. 2). 
Interestingly, priming with a combination of IFN-y and LPS, a con- 
dition that markedly repressed NRROS expression, attenuated the ROS- 
production difference between wild-type and NRROS-deficient cells 
(Fig. 1g), supporting the concept that inflammatory signals could con- 
trol ROS production by regulating NRROS expression. Finally, NRROS 
exhibited a similar expression pattern and regulation in human cells 
(Extended Data Fig. 3a—c), and short interfering RNA (siRNA)-mediated 
knockdown of NRROS in human monocyte-derived macrophages also 
resulted in increased ROS production (Extended Data Fig. 3d-h). Taken 
together, these results suggested that NRROS functioned as a general 
negative regulator of ROS production in phagocytes. 

Increased ROS production from NRROS-deficient BMDMs was also 
observed upon challenge with bacterial pathogens, including E. coli, 
Salmonella enterica serovar Typhimurium (hereafter S. typhimurium) 
and heat-killed L. monocytogenes (Fig. 2a—c and Extended Data Fig. 4a—c). 
More importantly, NRROS-deficient BMDMs and neutrophils were 
highly bactericidal in vitro (Fig. 2d and Extended Data Fig. 4d), which 
was not due to increased nitric oxide production or defective phagocy- 
tosis (Extended Data Fig. 4e, f). Conversely, enhanced bacterial killing 
by NRROS-deficient macrophages was eliminated upon pre-treatment 


i) 
s 
ir) 


— WT HKLM 


2.0, — WT E. coli 2.54— WT S. typhimurium 08 


a — KOE coli _ 25)— ko 8! typhimurium _ — KO HKLM 
S & 2.0 So 06 
x & 1.5) & lt 
5 > 1.0 aunt |: 5 04 i 
| =| m aa] 0.2 
iia x 0.5 ac 
0.0 0.0-——_—__—_—_. 0.0 +.—_.—__.—_. 
0 30 60 90 120 0 30 60 90 120 0 30 60 90 120 
Time after stimulation (min) Time after stimulation (min) Time after stimulation (min) 
d e 
S a 3 i WT+DPI __ 1005 
xX xX @ KO+DPI = 80 = 
z 7? zs 60 
to) toy 2 
3 31 zg 40 wr 
3 J 5 - 
G care 2 a *+KO 
30 60 120 30 60 120 0 5 10 15 
Time after stimulation (min) Time after stimulation (min) Time after infection (days) 
g h Liver Spleen i 
8 ae ge 2 oe 6% = 
@ 2 10 S4 & 
2 10 x 3 x a 
zs * = 6 - 3 2 
zc 5 are b D9 Fa 
= = s 34 3B 
ie} ie} 2 0 o 0 
EEGEERQ WT KO WT KO ee ee 
Time after infection (days) 
Listeria — + = + 
PMA = + 
j Liver Spleen k Liver Spleen 
S 1.0 ge 15 Ss 8) S 25 
x a x & 20 
zs = 1.0 = = 
= = “a0 > 45 
& 05 & ai tel ne 
3 : 305) , 3 05 3 . 
6 oli 6 a % 0.0 6 0. 
CWO WO SW WO RRS RRS 
o” or 
= > ROS - + - Ea 


scavengers 


Figure 2 | Increased ROS and bactericidal ability in NRROS-deficient 
phagocytes. a—c, ROS production in WT and NRROS-deficient (KO) BMDMs 
stimulated in vitro with E. coli (a), S. typhimurium (b) or heat-killed Listeria 
monocytogenes (HKLM; c). d, e, In vitro E. coli killing by IFN-y-primed WT 
and KO BMDMs without (d) or with (e) diphenyleneiodonium (DPI) pre- 
treatment. f, Survival curve of L. monocytogenes-infected WT (n = 5) and KO 
(n = 6) mice. g, Ex vivo ROS production by L. monocytogenes-infected WT 
and KO splenocytes (n = 3). h, L. monocytogenes burden in liver and spleen of 
WT (n= 5) and KO (n = 6) mice. i, Survival curve of L. monocytogenes- 
infected WT (n = 5), KO (n = 6), NOX2-deficient (NOX2 KO) (n = 5) and 
NRROS NOX2 double-deficient (DKO) (n = 3) mice. j, L. monocytogenes 
burden in liver and spleen of WT, KO, NOX2 and DKO mice (n = 3). 

k, L. monocytogenes burden in liver and spleen of WT and KO mice (n = 5) 
pre-treated with ROS scavenger cocktail. Error bars, s.e.m. *P < 0.05, 

**P < 0.01, ***P < 0.001. Unpaired Student’s t-test with (a-c) and without 
(d, e, g, h, j, k) Welch’s corrections, log-rank (Mantel-Cox) test (f, i). c.fu., 
colony-forming units. Data are representative of three (a-f, h) and two 

(g, i-k) independent experiments. 
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with diphenyleneiodonium, an inhibitor of ROS production (Fig. 2e). 
ROS is known to contribute to controlling L. monocytogenes infection, 
especially at higher bacterial burden’®”’. At a dose at which most wild- 
type mice succumbed, most NRROS-deficient mice survived the infec- 
tion (Fig. 2f). Further, NRROS expression in splenic myeloid cells during 
infection, even though detectable, was highly repressed (Extended Data 
Fig. 4g). In accordance with our in vitro data, ROS production was sig- 
nificantly higher in spleens of infected NRROS-deficient mice (Fig. 2g). 
Importantly, survival of NRROS-deficient mice correlated with signifi- 
cantly reduced L. monocytogenes burden in livers and spleens of these 
mice (Fig. 2h). Similar enhanced bacterial killing was also observed 
upon inoculation of NRROS-deficient mice with live E. coli (data not 
shown). The protective effects observed in NRROS-deficient mice dur- 
ing L. monocytogenes infection were completely abrogated when the 
mice were crossed to NOX2-deficient mice or treated with ROS sca- 
vengers (Fig. 2i-k)”’, suggesting that increased ROS production in NRROS- 
deficient phagocytes contributed to the enhanced host defence against 
invading pathogens. 

Despite the beneficial role in host defence, increased ROS produc- 
tion can also cause collateral tissue damage. As a negative regulator, 
NRROS might be required to control excessive ROS production during 
inflammation. Indeed, NRROS-deficient bone marrow chimaeric mice 
showed enhanced disease severity and high mortality in a myelin oli- 
godendrocyte glycoprotein (MOG)35_55-complete Freund’s adjuvant 
(CFA)-induced experimental autoimmune encephalomyelitis (EAE) model 
(Fig. 3a, b). A similar high severity of EAE was observed in phagocyte- 
specific NRROS-deficient mice (Nrros™" LysM-Cre?®*) (Extended Data 
Fig. 5a—c), excluding a possible contribution of T cells, especially Ty17 
cells, in this phenotype’*"*. Histological analysis revealed markedly 
increased lesion severity in Nrros"LysM-Cre?® mice compared to 
control mice (Extended Data Fig. 5d, e). Central nervous system (CNS)- 
infiltrating leukocytes—especially CD11b myeloid subsets—from NRROS- 
deficient bone marrow chimaeric mice showed significantly increased 
ROS production (Fig. 3c, d). Levels of malondialdehyde, one of the 
major toxic products of lipid peroxidation and indicator of oxidative 
damage”, were significantly higher in the CNS tissues from NRROS- 
deficient mice than in those from wild-type mice (Fig. 3e, f). Finally, 
treatment with ROS scavengers after EAE induction reduced disease 
severity and abrogated the difference between wild-type and NRROS- 
deficient bone marrow chimaeric mice (Fig. 3g) or Nrros!"LysM- 
CreP®* mice (Extended Data Fig. 5f, g), supporting the premise that 
the increased disease severity in NRROS-deficient mice was caused by 
augmented tissue oxidation and damage. In summary, these data dem- 
onstrated an indispensable role for phagocyte-specific NRROS in lim- 
iting ROS-induced tissue damage under inflammatory conditions. 

Next we investigated how NRROS regulates ROS production from 
myeloid cells. We did not detect any differences in mitochondrial ROS 
(mROS) generation despite an increased total ROS production in NRROS- 
deficient BMDMs treated with IFN-y and zymosan (Fig. 4a) or with 
mROS-specific stimulators, such as antimycin A and rotenone (Ex- 
tended Data Fig. 6a, b). Conversely, pharmacological inhibition of NOX2 
but not of mROS production largely abrogated the differential ROS gen- 
eration between wild-type and NRROS-deficient BMDMs (Extended 
Data Fig. 6c-g), implying that NRROS specifically regulates the func- 
tion of the NOX2 complex. The NOX2 complex is composed of two 
membrane-bound subunits, NOX2 and p22??™ (encoded by Cyba) 
and four cytosolic subunits, p40P?°*, p47P2°*, p67P?™* and RAC (RAC1 
or RAC2)!**!6, NOX2 and p22? form a heterodimer known as fla- 
vocytochrome bssg in the endoplasmic reticulum (ER), and this het- 
erodimerization is essential for the stability of these proteins’”’*. In 
NRROS-deficient BMDMs, the expression, phosphorylation and acti- 
vation of the cytoplasmic subunits were comparable to those of wild- 
type BMDMs (Extended Data Fig. 7a-c). However, protein, but not 
mRNA, levels of NOX2 and pare were increased in NRROS-deficient 
BMDMs (Fig. 4b and Extended Data Fig. 7d, e). Similarly, an increased 
expression of NOX2 and p22? was observed in various primary 
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Figure 3 | Mice with NRROS-deficient haematopoietic cells develop severe 
EAE owing to oxidative damage in the CNS. a, b, Clinical course (a) and 
survival curve (b) of WT and NRROS-deficient (KO) bone marrow chimaeric 
mice immunized with MOG35_55-CFA (n = 20). c, ROS production in CNS 
cells from EAE-induced mice (n = 4 WT, 3 KO). d, ROS production in 
CD11b* CNS cells from EAE-induced mice analysed by fluorescence-activated 
cell sorting (n = 5). MFI, mean florescence intensity. e, f, Malondialdehyde 
(MDA) adducts in the CNS tissues of EAE-induced mice analysed by western 
blot (e) and enzyme-linked immunosorbent assay (ELISA) (f) (n = 7). GAPDH 
as loading control. g, Clinical course (left) and survival curve (right) of EAE- 
induced mice (n = 9) treated with a ROS scavenger cocktail from day 10 
(arrow). Error bars, s.e.m. *P < 0.05, **P < 0.01, ***P < 0.001. Unpaired 
Student’s t-test without (a, d, f, g) and with (c) Welch’s corrections, log-rank 
(Mantel-Cox) test (b, g). Data are representative of two (c-g) and four (a, b) 
independent experiments. 


phagocytes from NRROS-deficient mice after inflammatory challenges 
in vivo (Extended Data Fig. 7f-h), supporting a role for NRROS in 
regulating protein expression or stability of the flavocytochrome bss¢ 
subunits in phagocytes. 

To better understand how NRROS regulates the expression of NOX2 
and p22? proteins, BMDMs were treated with cycloheximide to block 
de novo protein synthesis. NOX2 and p22??™ were markedly stabilized 
for extended durations in NRROS-deficient BMDMs compared to wild- 
type cells upon treatment (Fig. 4c), suggesting that NRROS partici- 
pated in the degradation of these proteins. If not dimerized, NOX2 and 
p22Ph° are presumed to be quickly degraded through a proteasome- 
dependent ER-associated degradation (ERAD) pathway’”’’. However, 
upon translocation out of the ER to phagosomal and plasma membranes, 
the NOX2-p22""™ complex might also get degraded by other mechan- 
isms, such as the lysosome-associated degradation pathway. Interest- 
ingly, the proteasome inhibitor MG132, but not the lysosome blocker 
chloroquine, normalized the differences in NOX2 or p22?"™ protein 
expression between wild-type and NRROS-deficient BMDMs (Fig. 4d, 
e and Extended Data Fig. 8a—e). Subcellular fractionation revealed that 
the accumulation of NOX2 and p22?" in NRROS-deficient cells started 
in the ER (Fig. 4f). In addition, most NOX2 detected in BMDMs was 
the 58-kDa ER-bound form (Fig. 4f)”°, which was sensitive to both 
Peptide-N-glycosidase F (PNGase F) and Endoglycosidase H (Endo 
H) enzymes (Fig. 4g)*’. Finally, when newly synthesized NOX2 was 
metabolically labelled with [°°S]methionine, higher NOX2 levels were 
already evident as early as the end of 1-h pulse period (time 0) and a 
slower decay of NOX2 persisted for up to 4 h during the chase period in 
NRROS-deficient BMDMs (Fig. 4h). This increased NOX2 was Endo 
H sensitive (Extended Data Fig. 8f), demonstrating its ER origin. In 
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Figure 4| NRROS regulates the activity of NOX2 oxidase complex in 
BMDMs. a, Zymosan-induced total ROS and mitochondrial ROS (mROS) in 
IFN-y-primed WT and NRROS-deficient (KO) BMDMs. Fold change relative 
to unstimulated control. b, Left, NOX2 and p22Phex in unprimed (—) and IFN-y- 
primed (+) BMDMs. Actin was used as loading control. Right, densitometry 
analyses relative to actin (n = 4). c-e, NOX2 and p22?* in IFN-y-primed 
BMDMs treated with cycloheximide (c), MG132 (d) or chloroquine (e). 

f, Subcellular fractionation and blotting of unprimed (-) and IFN-y-primed (+) 
BMDMs. PM, plasma membrane. g, NOX2 in PNGase- or Endo 

H-treated IFN-y-primed BMDM lysates. M-csfR was used as control for Endo 
H resistance. Actin was used as loading control. h, i, Left, autoradiography of 
immunoprecipitated (IP) NOX2 from [°°S]methionine-labelled IFN-y-primed 
BMDMs without (h) or with (i) MG132 treatment. Right, densitometry 
analyses relative to control in the flow-through with (h) or without 

(i) normalization to 0 h samples (n = 3). Error bars, s.e.m. *P < 0.05, 

**P < 0.01 (unpaired Student’s t-test). Data are representative of two (a, g) and 
at least three (b-f, h-i) independent experiments. 


addition, the difference in protein degradation was abrogated when 
cells were treated with MG132 (Fig. 4i), suggesting that NRROS facili- 
tated the degradation of nascent NOX2 through the ERAD pathway in 
phagocytes. 

Consistent with these results, we demonstrated that NRROS mostly 
resided in the ER by subcellular fractionation, flow cytometric analysis 
and immunofluorescent staining (Fig. 5a and Extended Data Fig. 8g, h). 
Recent studies have suggested that the heat shock protein (HSP)70- 
HSP90-CHIP axis is involved in the degradation of NOX2 through 
ERAD’”. However, expression of HSP70, HSP90 and CHIP were com- 
parable between wild-type and NRROS-deficient macrophages (Extended 
Data Fig. 9a). Even though pre-treatment with the HSP90 inhibitor 
radicicol markedly inhibited NOX2 expression in both wild-type and 
NRROS-deficient cells—almost to a level where the difference in ROS 
production between wild-type and NRROS-deficient BMDMs was no 
longer observed (Extended Data Fig. 9b, c)—the higher NOX2 levels in 
NRROS-deficient BMDMs were maintained (Extended Data Fig. 9d). 
These data suggested that NRROS most likely controls NOX2 degra- 
dation in the ER by a different mechanism, although we could not exclude 
a possible cooperation with the HSP90-HSP70-CHIP axis. 
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Figure 5 | NRROS promotes NOX2 degradation in the ER. a, Subcellular 
fractionation of RAW cells and western blot analysis. Numbers represent 
fractions. b, Anti-NOX2-immunoprecipitated NRROS and p22?”™* from 
IFN-y-primed BMDMs. iso, isotype control. c, NRROS-Flag, NOX2 and 
p22?" immunoprecipitated with Flag or NOX2 antibody from CHO cells 
transfected with the indicated plasmids. Bottom panels, input samples. Actin 
was used as loading control. Numbers below indicate lanes. d, Autoradiography 
of NOX2 from [*°S]methionine-labelled, MG132-treated 293T cells transfected 
with the indicated plasmids. Right graph, densitometry relative to time 0 

of cells transfected with NOX2 alone. e, NOX2 and p22Phex in CHO cells 
transfected with the indicated ratio of NOX2/p22?™ (1:1)-expressing plasmids 
to NRROS-Flag-expressing plasmid. Actin was used as loading control. 
Numbers below indicate lanes. f, Model depicting NOX2 complex regulation 
by NRROS. **P < 0.01, ***P < 0.001 (unpaired Student’s t-test). Data are 
representative of at least three independent experiments. 


Next, given their colocalization in the ER, we examined whether 
NRROS interacts with the NOX2-p22?"™ complex. Interestingly, co- 
immunoprecipitation of NRROS by NOX2 was detected in wild-type 
BMDMs (Fig. 5b). As neither the NOX2 nor the p22Phex monomer is 
stable in primary phagocytes, we further studied their interaction with 
NRROS in CHO and 293T cells”. Surprisingly, upon ectopic over- 
expression of Flag-tagged NRROS, together with NOX2 and/or p22?"™ 
in CHO cells, we observed a specific interaction between NRROS and 
NOX2 only, and not between NRROS and the p22??™ monomer or the 
NOX2-p22?"™ heterodimer (Fig. 5c, lanes 5-7, top panel). On the other 
hand, NOX2 could form a heterodimer with either NRROS or p22Pbex 
(Fig. 5c, lanes 5-7, middle panel). Importantly, p22Phex interfered with 
the interaction between NRROS and NOX2 (Fig. 5c, lanes 5 and 7, top 
and middle panels). Co-expression of NOX2 with NRROS led to a sub- 
stantial downregulation of NOX2 protein (Fig. 5c, lanes 3 and 5, bot- 
tom panel), but not the transcript (data not shown). The expression of 
p22P°°* protein remained unchanged or was only mildly reduced upon 
NRROS co-expression (Fig. 5c, lanes 2 and 4, bottom panel). Consistent 
with the co-immunoprecipitation data, the formation of a NOX2-p22?>™ 
heterodimer prevented the NRROS-mediated downregulation of NOX2 
(Fig. 5c, lanes 5 and 7, bottom panels). Similar results were also obtained 
in 293T cells (data not shown). Pulse chase experiments in 293T cells 
further suggested that NRROS facilitated the degradation of NOX2 
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through the ERAD pathway, which could be prevented either by MG132 
treatment or co-expression of p22?"™ (Fig. 5d). Finally, a relatively stable 
NOX2-p22?** complex could be shifted to the NRROS-mediated deg- 
radation pathway with increased molar ratio of NRROS to NOX2- 
paar, where in addition to the decline in NOX2 levels, a reduction 
in p22Phex levels was observed as well (Fig. 5e, lanes 5 to 8). 

Altogether, our data support a model in which NRROS directly inter- 
acts with nascent NOX2 monomers and facilitates their degradation 
through the ERAD pathway (Fig. 5f)**. p22?" competes with NRROS 
to form a much more stable NOX2-p22?"™ complex. Certain inflam- 
matory signals can regulate the expression of NRROS, and thereby mod- 
ulate the expression of the NOX2 complex and ROS production (Fig. 5f). 
The delicate regulation of NOX2 by NRROS ensures that there is suf- 
ficient ROS production by phagocytes during host defence while at the 
same time limiting collateral tissue damage. Further understanding of 
this pathway may provide novel therapeutic approaches to target ROS 
production in various diseases. 


METHODS SUMMARY 


All animal experiments were approved by the Genentech Institutional Animal Care 
and Use Committee (IACUC). Mice were infected with L. monocytogenes and mon- 
itored for mortality or euthanized for bacterial burden analyses. EAE was induced 
by immunizing mice with MOG35.55-CFA/pertussis toxin and monitored for clin- 
ical score and mortality. Oxidative damage in the CNS of EAE-induced mice was 
assessed by malondialdehyde measurement with ELISA or western blot (OxiSelect 
MDA Kit, Cell Biolabs). In vivo ROS neutralization was achieved by injecting mice 
with a cocktail of scavengers (15 mg kg ' EUK134 (Cayman Chemical), 20 mg kg * 
FeTPPS (Calbiochem) and 50 mgkg ‘ PBN (Sigma-Aldrich)). In vitro E. coli kill- 
ing was assessed by gentamicin protection assay. In vitro, ROS was measured by 
incubating primed phagocytes with an inducing agent and either luminol for con- 
tinuous detection by luminometer or CM-H2DCFDA (total ROS) or MitoSOX (mROS) 
(Invitrogen) for FACS analyses. Ex vivo ROS was measured by incubating cells with 
CM-H,DCFDA andanalysed by FACS. Whole cell/tissue lysates prepared in RIPA 
or Triton X-100 buffer were used for immunoprecipitation and western blot stud- 
ies. Metabolic labelling of cells was performed using [*°S]methionine. Subcellular 
fractionation was carried out by sucrose gradient ultracentrifugation. Total RNA 
was used in microarray or Taqman expression analyses. 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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METHODS 

Mice. NRROS-deficient mice were generated in collaboration with Lexicon 
Pharmaceuticals” and backcrossed to C57BL/6J more than 14 generations. NOX2- 
deficient mice were procured from Jackson Laboratories and bred with NRROS- 
deficient mice to generate NRROS NOX2 double-deficient mice. NRROS conditional 
knockout (Nrros™") mice were generated in-house by targeting exon 4 of Nrros 
that encodes most of the protein. These floxed mice were bred with LysM-Cre mice 
(kind gift from University of California, San Diego) to specifically delete NRROS 
expression in M lysozyme-expressing cells, namely macrophages, monocytes and 
neutrophils. Mice were routinely genotyped by PCR using the following primers: 
5'-TACTCACTGGCCTTGCCCTC-3’, 5’-AGAGTTCCATCCTCAGTCCC-3’ and 
5'-GTTCACATCTGGTGCCAGAC-3’. C57BL/6] and B6.SJL-Ptpre“Pepc’/Boy] 
mice were procured from Jackson Laboratories. Rag2-deficient mice (B6.129S6- 
Rag2'”""*“4 N112) were obtained from Taconic. 

For the generation of bone marrow chimaeras, B6.SJL-Ptpre*Pepc’/Boy] recip- 
ient mice were irradiated with a total of 1,050 rads (in two doses of 525 rads sep- 
arated by 4-h interval) using a °’Cs source. 10-15 X 10° total bone marrow cells 
isolated from femur and tibias of the wild-type and NRROS-deficient donor mice 
were injected into the tail vein. Reconstituted mice were given water containing 
0.11 mg ml polymyxin B and 1.1 mg ml ' neomycin for 2 weeks and then switched 
to regular water. Mice were allowed to reconstitute for at least 12 weeks (more than 
90% reconstitution as assessed by CD45.1 (A20, catalogue no. 553776) and CD45.2 
antibody (104, catalogue no. 558702) (BD Biosciences) staining and FACS (LSR II, 
BD Biosciences)) and used for in vivo experiments. All animal experiments were 
approved by the Genentech Institutional Animal Care and Use Committee. 
Isolation of cells. For BMDMs, femur and tibia were collected from 3—5 mice of 
each genotype and bone marrow cells were flushed with complete DMEM con- 
taining 50 U ml 1 penicillin, 50 mg ml~ : streptomycin, 2 mM L-glutamine, sodium 
pyruvate and 10% FBS. Erythrocytes were removed by ACK red blood cell lysis buffer 
treatment and the cell suspension was filtered through a 70-j1m cell strainer to 
remove any cell clumps. The single-cell suspensions were then cultured at 37 °C for 
1h, and non-adherent cells were collected and re-plated in complete DMEM with 
20% L929 conditioned media. To fully differentiate BMDMs, the cells were cultured 
for an additional 6 days with media change every 3 days. All the cells, when analysed 
by FACS, were CD11b*F4/80* (BD Biosciences). In some experiments, recom- 
binant mouse M-CSF (R&D Systems) was used at a concentration of 50 ng ml? 
instead of L929 medium. BMDMs were primed with 10 ng ml of IFN-y (R&D 
Systems) for 18—24h, unless otherwise stated. 

For peritoneal macrophages, mice were injected with 1 ml of 4% thioglycollate, 
and 4 days later peritoneal cells were collected by lavage using 8 ml PBS. Cells were 
allowed to adhere for at least 2 h and adherent macrophages were collected, counted 
and plated in complete DMEM for further analysis. 

For neutrophil isolation, single-cell suspensions of the total bone marrow cells 
from 3-4 mice were prepared and subjected to Percoll gradient separation (78- 
69%-52%) by centrifugation at 1,500g for 40 min. Neutrophils were collected from 
78-69% interphase, washed twice, re-suspended in HBSS and used for further anal- 
ysis. Cell purity was more than 90% as analysed by FACS staining with CD11b 
(M1/70, catalogue no. 48-0112; eBioscience) and Gr-1 antibodies (RB6-8C5, cata- 
logue no. 553126; BD biosciences). 

For studies with human cells, blood received at Genentech health centre from 
healthy donors was used to isolate peripheral blood mononuclear cells (PBMCs) by 
Ficoll gradient (GE Healthcare). Monocytes were further isolated from the PBMCs 
using human monocyte isolation kit II (Miltenyi Biotec) according to the manufac- 
turer’s instructions. Monocyte-derived macrophages were generated by culturing 
the purified monocytes for 7 days in complete RPMI 1640 containing 10% FBS and 
100ngml' human M-CSF (R&D Systems). Fresh medium containing 100 ng ml“ 
of M-CSF was added to cells on day 4 and cells were used for further analysis on 
day 7. PBMCs were also used to isolate B cells and T cells using CD19 microbeads 
and Pan T Cell Isolation Kit II (Miltenyi Biotec), respectively, according to the manu- 
facturer’s instructions. 

Isolation of CNS-infiltrating mononuclear cells. Animals were perfused with 
30 ml of PBS, and brain and spinal cords were collected. A single-cell suspension 
was obtained by grinding the tissues using 1 ml syringe plunger and passed through 
a 70-|tm strainer. Mononuclear cells were obtained by Percoll gradient (37/70%, 
GE Healthcare) centrifugation at 390g for 30 min at room temperature (20-25 °C) 
collected from the interphase, washed and used for further analysis. 

Plasmids. Sequence coding murine NRROS protein was amplified by PCR using 
mouse splenic complementary DNA as template, and inserted into BglII(blunt)/ 
EcoRI site of Nflag-pRK vector which has signal peptide sequence and Flag-tag 
sequence. Then, cDNA encoding signal peptide-Flag-tag~-NRROS was transferred 
onto retroviral vector pMSCV-IRES-GFP to generate Flag- NRROS-pMSCV. This 
amino-terminally Flag-tagged NRROS (Flag-NRROS) was used to generate RAW 
stable cell lines. A plasmid expressing carboxy-terminally Flag-tagged NRROS 


(NRROS-Flag) was generated by cloning PCR-amplified coding region of NRROS 
including its signal sequence into pRK5 vector using Ascl and Xbal sites to generate 
NRROS-Flag-pRK. Mouse Cyba and Cybb genes encoding p22?" and NOX2, 
respectively, were amplified by PCR using mouse splenic cDNA and cloned into 
pRKS vector using EcoRI and Xbal sites to generate p22?"°*-pRK and NOX2-pRK, 
respectively. 
Generation of Flag-NRROS-RAW cells by retroviral transduction. Phoenix E 
cells were transfected with either Flag- NRROS-pMSCV or pMSCV-IRES-GEP con- 
trol plasmid by the calcium phosphate method. After 48 h, the retroviral supernatants 
were collected, filtered through 0.45-j1m filters, supplemented with polybrene 
(10 tg ml’) and added to RAW cells that were plated in a six-well plate the previ- 
ous night. Plates were centrifuged at 1,200g at 32 °C for 120 min. The viral super- 
natant was replaced by complete DMEM after an additional 2 h. Transduced RAW 
cells were sorted (FACSAria, BD Biosciences) based on GFP expression. 
Transfection of cells. CHO cells were transfected with the indicated plasmid 
using Lipofectamine 2000 (Life Technologies) according to manufacturer’s instruc- 
tions. In brief, 10 1g total DNA was diluted in 500 pil of opti- MEM and 25 pl of 
Lipofectamine 2000 was diluted in another 500 ul of opti- MEM. The two were 
combined, incubated for 20 min and added drop-wise to cells grown on 100-mm 
plates at 70-80% confluency. After 24h, cells were collected by trypsinization and 
lysed in RIPA buffer (50 mM Tris, pH 7.4, 150 mM NaCl, 2mM EDTA, 1% NP-40, 
1% SDS) and subjected to western blot analysis or were used for immunoprecip- 
itation with the indicated antibody. For the titration experiments, 2.5 1g each of 
NOX2-pRK and p22?"°*-pRK plasmids were mixed and then serially diluted two- 
fold while maintaining their 1:1 ratio. The amount of NRROS-Flag-pRK plasmid 
was held constant at 2.5 ug and added to the above tubes so that the ratio of NOX2- 
pRK:p22?"**-pRK:NRROS-Flag-pRK was 1:1:1, 1:1:2, 1:1:4 and 1:1:8. Total amount 
of DNA was adjusted to 10 Lg with empty pRK plasmid. After 24h cells were col- 
lected, lysed and subjected to western blot analyses. 293T cells grown in 100-mm 
plates were transfected using Fugene 6 reagent (Promega) at 3:1 ratio (DNA:Fugene), 
according to the manufacturer’s instructions and processed as described for CHO 
cells. Both CHO and 293T cells were obtained from an internal cell-banking pro- 
gram (gCell) at Genentech. 
siRNA knockdown. Chemically modified siRNAs” were synthesized in house. 
siRNA specific for Nrros was composed of 5’-mAmGAGCUAGAUUUGCAGAd 
GdAdAmAmCmU-3’ (sense) and 5’-dTUUCUCUGCAAAUCUAGCUmCmU- 
3’ (antisense), and the control scrambled siRNA was 5'-mGmGAGCGCACCAU. 
CUUCdTdCdAmAmUmU-3’ (sense) and 5’-dTUGAGAAGAUGGUGCGCUm 
CmC-3’ (antisense). RAW cells were transfected with ~600 pmol siRNA using Cell 
Line Nucleofection Kit V (Lonza). 24 h later, cells were re-plated at 5 X 10* cells per 
well ofa 96-well clear-bottom white plate (Corning), primed with 10 ng ml” IFN-y 
(R&D Systems) for another 24h and used for further analysis. 
ROS measurement. For luminol-based ROS measurement, 1 X 10° macrophages 
were plated in clear-bottomed 96-well white plates (Corning) in triplicate, and primed 
for 24h with 10 ng ml’ of IFN-y, 100 ng ml * LPS (Invivogen) or 100 ng ml”? TNF-« 
(R&D Systems), either alone or in combination. In most of the experiments, cells 
were primed with 10ng ml! IFN-y unless otherwise indicated. Oxidative burst 
was induced in luminol medium (PBS containing 5mM glucose, 1 mM MgCl, 
0.5 mM CaCl, and 100 tM luminol (Sigma-Aldrich)) by stimulating macrophages 
with 100 pg ml” * zymosan A (Invivogen), 100 ng ml” ' PMA (Sigma-Aldrich), heat- 
killed Listeria monocytogenes (Invivogen) at multiplicity of infection (m.o.i.) 50, 
E. coli strain 26 (ATCC) at m.o.i. 50, or S. typhimurium SL1344 (a kind gift from 
D. Monack, Stanford University) at m.o.i. 10. Luminescence was measured by 
Glomax luminometer (Promega). Neutrophils and CNS mononuclear cells were 
re-suspended in the luminol medium (1 X 10° cells per well) and stimulated with 
100 ngml_' PMA or 100 pg ml! zymosan. For experiments with inhibitors, the 
following chemicals were added to the cell cultures during the last 1h of the 
priming: 5 .M diphenyleneiodonium (Calbiochem), 20 1M antimycin A (Sigma- 
Aldrich) or 3mM apocynin (Calbiochem) unless otherwise indicated. For treat- 
ment with apocynin, the compound continued to be present during the stimulation, 
whereas for other treatments, cells were washed with PBS before ROS induction. 
For FACS-based measurements, macrophages plated in 6-well plates were treated 
with the indicated stimulant for 30 min at 37 °C in the presence or absence of 5 uM 
CM-H3DCFDA (Invitrogen) to measure total intracellular ROS or 5 pM MitoSOX 
(Invitrogen) to measure mitochondrial ROS. Cells were washed with PBS, collected 
by trypsinization and analysed by FACS. For ROS measurement induced by rote- 
none (Calbiochem) or antimycin A, cells were treated with or without 10 1M of the 
compounds for 2 h, washed with PBS, collected by trypsinization and analysed by 
FACS. Splenocytes and CNS mononuclear cells were stimulated with 100 ng ml! 
PMA in the presence or absence of 5 4M CM-H,DCEDA for 30 min, washed with 
PBS, collected and stained for surface markers CD11b (eBioscience), Gr-1, B220 
and CD3 (BD Biosciences) for 30 min on ice before analysing by FACS. 
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Measurement of nitric oxide. BMDMs were treated with 10ngml’ IFN-y, 
10ngml- 'TNE-o,, 100 ngml- LPS or their combination for 24h, and nitric oxide 
in the culture supernatant was measured using Griess assay (Promega) with Spectra 
Max 340 (Molecular Devices) spectrophotometer according to the manufacturer’s 
recommendations. 

RNA isolation and real-time RT-PCR. RNA was isolated using RNeasy kit 
(Qiagen). Total RNA of 20 different human tissues was obtained from Clontech. 
Mouse major tissues total RNA was procured from Zyagen. RNA samples were 
analysed by real-time RT-PCR with TaqMan One-Step RT-PCR Master Mix reagents 
(Applied Biosystems) and the appropriate primers and probes. Results were nor- 
malized to those of the housekeeping gene Rp/19 (encoding ribosomal protein L19) 
and are reported as 24‘T The primer and probe sets used were: Rp/19: forward 5'- 
GCATCCTCATGGAGCACAT-3’, reverse- 5'- CTGGTCAGCCAGGAGCTT-3’, 
probe 5'-CITTGCGGGCCTTGTCTGCCTT-3’; Nrros: forward 5'-ACTGCAGC 
TTCCCAAGGA-3’, reverse 5'-TGGGTACCGAAGCAAGGT-3’, probe 5'-AGT 
CAGCGACTCCGTCGACCAC-3’; Cybb (NOX2): forward 5'- ACTGCGGAGA 
GTTTGGAAGA-3’, reverse 5’-GGTGATGACCACCTTTTGCT-3’, probe 5’-GA 
GGTTGGTTCGGTTTTGGCGA-3’; and Cyba (p22P™): forward 5'-AAAGAGG 
AAAAAGGGGTCCA-3’, reverse 5'-ATGGCTGCCAGCAGATAGAT-3’, probe 
5'-ACTACGTCCGGGCTGCCCTC-3’. 

Microarray analysis. Statistical analyses of microarray data were performed using 
the R programming language (http://r-project.org). Microarray data were normal- 
ized using the RMA method”’. Data were prefiltered to remove probes that were 
not mapped to an annotated Entrez gene. We also filtered our data to retain only a 
single probe per gene, selecting the probe with the highest variance, if multiple 
probes were found for the gene”*. For differential expression analysis, the limma R 
package was used”. We modelled the synergistic regulation of gene expression by 
the combined IFN-y and LPS treatment as an interaction term in our linear model. 
This model will identify changes that are significantly different from the sum of 
the individual treatments. Multiple test correction was done using the method of 
Benjamini and Hochberg”. Genes were considered significantly different if they 
changed more than 1.4-fold at a false discovery rate of 0.05. Genes were further 
filtered for immune-cell-specific expression using the gene sets defined by the 
Immune Response In Silico (IRIS) project*'. As the IRIS-defined gene sets were 
derived from human immune cells, we mapped the human genes to mouse ortho- 
logues using the HomoloGene database*’. Genes from all IRIS-defined categories 
were included in the analysis. Data were submitted to the NCBI (accession number 
GSE53986) 

Subcellular fractionation. A discontinuous sucrose gradient was used for frac- 
tionation of cells as described*. In brief, cells were collected either by scraping (RAW 
cells) or by trypsinization (BMDMs), washed with PBS, re-suspended in isotonic 
solution (10mM Tris, pH 7.5, 0.5mM MgCl), incubated on ice for 15 min and 
homogenized using a Dounce homogenizer. Cell disruption was confirmed by try- 
pan blue staining. Homogenate was made isotonic with 1.46 M sucrose (250 mM 
final concentration) and centrifuged at 10,000g for 5 min to remove debris and 
unbroken cells. The homogenate was then layered onto a sucrose gradient of 0.58 M, 
0.88 M and 1.1 M, and ultracentrifuged at 100,000g for 2 h using TLS5 (Beckman 
Coulter optima). Five fractions numbered 1-5 were collected and analysed by 
western blot. ER and plasma membranes were enriched in the pellet and fraction 3, 
respectively. Fractions 1 and 2 were enriched in Golgi apparatus and lysosome. 
Tissue lysates, immunoprecipitation and western blot analysis. For lysing tis- 
sues to analyse NRROS expression, C57BL/6] mice were perfused with 30 ml PBS, 
organs were collected and homogenized in RIPA buffer containing protease and 
phosphatase inhibitor cocktail (Halt protease and phosphatase inhibitor cocktail 
from Pierce). To analyse NRROS expression in different immune cell types, FACS- 
sorted cells were lysed in RIPA buffer containing protease and phosphatase inhib- 
itor cocktail. For inhibitor treatments in BMDMs, cells were cultured in the pres- 
ence of cycloheximide (10 pg ml}, Sigma), MG132 (10 pg ml !, Calbiochem) or 
chloroquine (10 jig ml 1 Calbiochem) for the indicated time, collected and lysed 
in RIPA buffer. Neutrophils were lysed in solution containing 10% glycerol, 3% 
SDS, 1mM PMSF and 5 mM Naf, and heated immediately at 95°C for 5 min. 
Equal amounts of proteins, measured using BCA assay (Pierce), were separated by 
SDS-PAGE on 4-12% Bis-Tris gels (Invitrogen) and transferred onto nitrocellu- 
lose membranes using iBlot apparatus (Invitrogen). Membranes were blocked for 
1h at room temperature with 5% milk in TBST and probed with indicated anti- 
body in 5% milk overnight at 4 °C. 

The following primary antibodies were used: NOX2 (gp91P®°*) (54.1, catalogue 
no. sc-130543), p22Phex (FL-195, catalogue no. sc-20781), p47Phex (D10, catalogue 
no. sc-17845), p4oPhex (H-300, catalogue no. sc-30087), GFP (B-2, catalogue no. 
sc-9996) from Santa Cruz Biotechnology, p67°"™ (rabbit polyclonal, catalogue no. 
07-002) from Millipore, phospho-p40?"™* (T154, catalogue no. 4311), B-integrin 
(rabbit polyclonal, catalogue no. 4706), Na-K ATPase-« (rabbit polyclonal, cata- 
logue no. 3010), GAPDH-horseradish peroxidase (HRP) (14C10, catalogue no. 
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3683), HSP90 (C45G5, catalogue no. 4877), HSP70 (D69, catalogue no. 4876), CHIP 
(C3B6, catalogue no. 2080) from Cell Signaling Technology, phospho-p47?"™ 
(Ser370) (rabbit polyclonal, catalogue no. A1171) from Assay Biotechnology, cal- 
nexin (rabbit polyclonal, catalogue no. SPA-860) from Stressgen Biotechnonlogies, 
gos28 (rabbit polyclonal, catalogue no. GTX61678) from GeneTex, actin (rabbit 
polyclonal, catalogue no. A5060) from Sigma-Aldrich. Anti-NRROS monoclonal 
antibody was generated by immunizing Armenian hamsters with E. coli-derived 
recombinant NRROS (extracellular domain). Secondary antibodies include HRP- 
conjugated anti-rabbit (catalogue no. 7074), anti-mouse (catalogue no. 7076) (Cell 
Signaling Technology) and anti- Armenian hamster (Jackson Immunoresearch, cat- 
alogue no. 127-035-160). Image quantification was performed using ImageJ soft- 
ware (NIH). 

For glycosidase treatment, lysates were incubated with PNGase-F (New England 

Biolabs) for 1 h or with Endo-H (New England Biolabs) for about 18 h according to 
the manufacturer’s instructions. For NOX2 immunoprecipitation, lysates were incu- 
bated with 101g of the antibody overnight at 4°C, immunoprecipitates were 
captured with protein A/G ultralink resin (Thermo Scientific) and eluted with sam- 
ple buffer. For anti-Flag immunoprecipitation, lysates were incubated with EZview 
Red ANTI-FLAG M2 affinity gel (Sigma) at 4°C for 4h and eluted with 3 x Flag 
peptide (Sigma) according to manufacturer’s instructions. 
[?°S]Methionine metabolic labelling. Metabolic labelling of BMDMs was per- 
formed as decribed**”’. In brief, BMDMs were primed with IFN-y (10 ng ml © 1) for 
24h. Cells were washed and incubated with 25 ,.Ci ml! [°°S]methionine (Perkin 
Elmer) in DMEM high glucose medium without methionine (Life Technologies) 
supplemented with 50 U ml penicillin, 50 mg ml streptomycin, 2 mM L-glutamine, 
sodium pyruvate and 10% FBS. After 1 h of labelling, cells were either collected (for 
time 0) or incubated further for the indicated time in complete DMEM with non- 
radiolabelled methionine. 

Cells were washed three times with cold PBS, and then lysed in lysis buffer 
(50 mM Tris, pH 7.5, 150 mM NaCl, 1 mM EDTA, 1% Triton X-100) containing 
protease and phosphatase inhibitor cocktail (Halt protease and phosphatase inhib- 
itor cocktail from Pierce). Lysates were pre-cleared with control agarose beads (Pierce) 
for 1h and incubated with anti-NOX2 antibody overnight. Immunoprecipitates 
were captured using Protein A/G ultralink resin, eluted with sample buffer and sep- 
arated by SDS-PAGE. Gels were dried using a gel drier and exposed to X-ray film. 

For experiments with Endo-H digestion, immunoprecipitates were eluted in Endo 
H buffer (100 mM Na-acetate, pH 5.6, 0.1% SDS, 1 mM PMSF)**. Equal amounts of 
the eluted samples were incubated with or without Endo-H (New England Biolabs) 
for about 18 h and samples were used in SDS-PAGE as described above. Densitom- 
etry analysis was performed using ImageJ software and data were normalized to a 
nonspecific band in the input samples. 

For experiments using MG132, cells were pre-incubated with 20 1M MG132 
(Calbiochem) for 1h before pulse. The MG132 treatment was continued during 
pulse and until end of the chase period. 

Metabolic labelling and NOX2 immunoprecipitation in 293T cells was carried 
as described for BMDMs except that cells were pulsed with 100 wCiml ' [°°S] 
methionine for 15 min instead of 1 h. Overexpressing NOX2 by transient transfec- 
tion enabled us to pulse cells for a shorter duration (for example, 15 min) and still 
be able to immunoprecipitate and detect NOX2. 

RAC1 activation assay. The activity of RAC] in wild-type and NRROS-deficient 
BMDMs was measured using RAC] activation assay kit (Cell Biolabs) according to 
the manufacturer’s instructions. In brief, wild-type and NRROS-deficient BMDMs 
were primed with 10 ng ml" IFN-y for 24h. Cells were stimulated with 100 pg ml" 
zymosan for 5, 15 or 30 min. Cell lysates were prepared and used immediately for 
RACI pulldown with the p21-activated protein kinase (PAK)-p21 binding domain 
(PBD) beads. The activated RAC1 was detected by anti-RAC1 antibody supplied 
with the kit. 

Immunofluorescence staining. RAW 264.7 cells stably expressing Flag-NRROS 
were grown overnight in chamber slides. Cells were then fixed with 4% parafor- 
maldehyde, washed with PBS, and treated with 50 mM NH,CI/PBS. After rinsing 
with PBS, cells were permeabilized and blocked in saponin buffer (0.4% Saponin, 
1% BSA, 2% normal goat serum in PBS). Mouse anti-Flag (Sigma-Aldrich), rabbit 
anti-calnexin (Enzo Life Sciences) and rat anti- LAMP 1 (Santa Cruz Biotechnology) 
antibodies were used. Secondary goat anti-mouse Alexa 647 was purchased from 
Invitrogen and goat anti-rabbit-Cy3 and donkey anti-rat-Cy5 antibodies were from 
Jackson Immunoresearch. 

Confocal microscopy. Images were captured by a Leica SPE laser scanning con- 
focal microscope (Leica) equipped with Leica DM5500Q under 63 1.30 NA CS 
ACS APO oil-immersion objective controlled by LAS AF image acquisition and 
processing software. Excitation and emission for DAPI, Alexa 488, Cy3 and Cy5 
were used at default settings set by the manufacturer (Leica). 

Phagocytosis assay. dsRed E. coli (gift from K. Huynh, Genentech) were opso- 
nized with fresh mouse serum, and BMDMs primed with 10 ng ml! IFN-y for 
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24h were allowed to phagocytose the bacteria for 30 min. Cells were collected and 
analysed by FACS. 

In vitro bacterial killing assay. Macrophages plated at 0.25 X 10° cells per well of 
a 24-well plate were primed with 10 ng ml ' of IFN-y for 24h in antibiotics-free 
complete DMEM. Cells were infected with serum opsonized E. coli strain 26 at m.o.i. 
of 50 in 250 yl total volume by centrifugation at room temperature for 5 min at 
1,000 r.p.m. to synchronize the infection and further incubated at 37 °C for 30 min. 
Medium was replaced with 100 1g ml * of gentamicin (Invitrogen)-containing 
medium. After an additional 5 min (considered as 30-min time point), 30 min 
(60-min time point) and 90 min (120-min time point), cells were washed twice 
with 1 ml PBS and lysed in 250 pl of 0.1% Triton X-100 in PBS. Twofold serial 
dilutions of the lysates were prepared in PBS and 5 pl was dropped on a LB agar 
plate with no antibiotics. Colonies were counted after 18-24 h incubation at 37 °C. 
Total protein in the lysates was measured using BCA assay and the colony numbers 
were normalized to the amount of protein. 

Neutrophil bacterial killing assay was performed as described previously’. 

In brief, twofold dilutions (2.5-0.625 X 10°) of neutrophils were added to 25 X 
10° c.f.u. E. coli strain 26 and fresh mouse serum (5% final concentration) in a total 
volume of 1 ml, in triplicate. The tubes were rotated for 2 h and 100 jl of the sample 
was diluted with 1 ml sterile distilled water (pH adjusted to 11 by NaOH) to lyse 
cells. Serial dilutions were prepared and plated on LB agar plates without anti- 
biotics. Bacterial colonies were counted after 24h incubation at 37 °C. 
In vivo bacterial challenge. For L. monocytogenes infection studies, age- and sex- 
matched (6-8 weeks old) wild-type, NRROS-deficient, NOX2-deficient or NRROS NOX2 
double-deficient littermate mice were administered intravenously with L. mono- 
cytogenes (ATCC 43251) (0.1 X 10° c.f.u. for females and 0.5 X 10° c.f.u. for males), 
and were monitored for 14 days for morbidity and mortality. For bacterial burden 
experiments, mice were administered intravenously with 1 X 10°c.f.u. L. mono- 
cytogenes or 2 X 10%c.f.u. E. coli strain 26, and killed at 48h (L. monocytogenes) or 
6h (E. coli). Spleens and livers were collected in 50-ml tissue grinder tubes (VWR 
International) and homogenized in 0.1% Triton X-100 in PBS. Twofold serial dilu- 
tions of the homogenates were prepared and plated in triplicate on brain heart 
infusion agar. Bacterial colonies were counted after 24 h. Total protein in the lysate 
was measured using BCA assay and colony numbers were normalized to the amount 
of protein. 

For ROS neutralization, mice were injected intravenously with either a cocktail 
of scavengers** that included 15 mgkg” * EUK134 (Cayman chemicals) 20 mg kg * 
FeTPPS (5,10,15,20-tetrakis(4-sulphonatophenyl)porphyrinato iron (III) chlor- 
ide) (Calbiochem) and 50 mgkg ' PBN (N-tert-butyl-a-phenylnitrone) (Sigma- 
Aldrich) or 2.5% DMSO (Sigma-Aldrich) as vehicle control in a total volume of 
200 pl at 60 min before administering L. monocytogenes. The spleens and livers 
were collected after 24h and processed as described above, for enumeration of 
bacterial burden. 

For all L. monocytogenes infection experiments, a sample size of at least five mice 

per genotype was used in each independent experiment except for data in Fig. 2i, j 
where three mice were used for some groups. Mice of both genders were used. No 
animals were excluded from any study. No blinding or randomization method was 
used to group the animals. 
Induction of EAE. EAE was induced as described previously”. In brief, age- and 
sex-matched littermate mice were immunized subcutaneously at the base of the tail 
with 300 tpg MOG35_55 peptide emulsified in 800 ug Mycobacterium tuberculosis- 
supplemented CFA (Difco Laboratories). On day 0 and day 2 post-immunization, 
200 ng pertussis toxin (List Biologicals) was administered intraperitoneally. Clinical 
disease was scored daily starting on day 8 until day 30 as follows: 0, no clinical dis- 
ease; 1, limp tail or hind limb weakness but not both; 2, limp tail and hind limb 
weakness; 3, partial hind limb paralysis; 4, complete hind limb paralysis; 5, mori- 
bund/death. 

For ROS neutralization, a cocktail of scavengers was used as described previously”. 
In brief, mice were administered intraperitoneally every 12 h with the aforemen- 
tioned cocktail or vehicle control starting from day 9 or day 10 post-immunization 
depending on the appearance of first clinical signs. 

All the mice used for EAE experiments shown in Fig. 3 were bone marrow chi- 
maeric mice. The chimaeric mice were generated when they were around 6 weeks 


old and 8—12 weeks, or even longer, was allowed for reconstitution. Therefore, 
mice were 14—20 weeks old at the time of immunization. Male mice were used for 
most of the experiments. Nrros'’"LysM-Cre?°’ and Nrros™"'LysM-Cre"® mice 
used for data shown in Extended Data Fig. 5 were 6-8 weeks old. 

A sample size of at least five mice per genotype was used in each independent 
experiment. No animals were excluded from any study. No blinding or randomi- 
zation method was used in grouping the mice. 

Measurement of oxidative damage in CNS tissues. Oxidative damage in the 
CNS of EAE-induced mice was assessed by measuring malondialdehyde levels”. 
Brain and spinal cords from EAE-induced mice were collected and homogenized 
in PBS. Protein estimation was done by BCA assay. Homogenates were used to 
measure the levels of MDA by ELISA using OxiSelect MDA adduct ELISA kit (Cell 
Biolabs) or by western blot using OxiSelect MDA immunoblot kit (Cell Biolabs) 
according to the manufacturer's recommendations. 

Statistical analysis. Statistical significance was calculated by Prism 5 software 
(Graphpad) using appropriate statistical test mentioned in the figure legends. For 
ROS measurements by luminol, area under curve (AUC) was calculated for each 
individually stimulated sample (at least three samples per group). The AUC values 
were then used in unpaired t-test with Welch’s corrections to generate a P value for 
each experiment. The average AUCs from different experiments were used in paired 
t-test assuming equal variance to generate P values when comparing multiple exper- 
iments. Any other appropriate statistical methods applied for a data set are described 
in the figure legends. A P value of less than 0.05 was considered as significant. 
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Extended Data Figure 1 | Murine NRROS is primarily expressed in immune 
tissues, especially in phagocytes. a, BMDMs were either untreated or treated 
with IFN-y, LPS or both. Heat map displays genes that were preferentially 
expressed in leukocytes and differentially regulated by the synergistic effects of 
both the treatments compared to either treatment alone. b, Analysis of Nrros 
expression in total RNA from different mouse tissues. Expression levels were 
normalized to Rpl19 (n = 4). c, Western blot analysis of NRROS protein 
expression in different mouse tissues. GAPDH used as loading control. 

d, Analysis of Nrros expression in total RNA from immune cells sorted from 
mouse spleen, normalized to Rpl19 (n = 3-5). e, Western blot analysis of 
NRROS protein expression in the indicated cells. HSP90 used as loading 
control. Mac, peritoneal macrophage; n.s., non-specific band observed only in 
lymphoid cells but not in macrophages with anti-NRROS antibody. 


f, Schematic of the strategy used to generate NRROS-deficient mice. g, Analysis 
of Nrros expression in total RNA from spleens of WT and KO mice (n = 5), 
normalized to Rpl19. h, Western blot analysis of NRROS protein expression in 
3T3 cells overexpressing NRROS (NRROS-3T3) and control cells (GFP-3T3) to 
screen anti- NRROS antibody. Actin used as loading control. i, Western blot 
analysis of NRROS protein expression in WT and KO BMDMs. GAPDH as 
loading control. j, Western blot analysis of NRROS protein expression in 
immune tissues from WT and KO mice. Actin used as loading control. 
Although there was no detectable specific band in KO BMDMs, a weak, 
nonspecific band was observed in lymphoid cells from both WT and KO mice 
that was present just above the specific NRROS band. Error bars, s.e.m. Data in 
c, e and j are representative of at least three independent experiments. 
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Extended Data Figure 2 | NRROS negatively regulates ROS production in 
various myeloid subsets. a—e, ROS production in thioglycollate-elicited 
peritoneal macrophages (a, b), neutrophils (c, d) and BMDM (e) stimulated 
with zymosan (zymo) (a, c) and phorbol 12-myristate 13-acetate (PMA) 

(b, d, e). Cells in a, b, e were primed with IFN-y before ROS induction. Left 
panels, ROS kinetic plots of one representative experiment with three 
independent samples per group. Right panels, AUC from at least three 
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independent experiments. f, ROS production in splenocytes from WT and KO 
mice (” = 3) stimulated with PMA in the presence of CM-H2DCFDA and 
analysed by FACS. MEI of CD11b* Gr-1'°"”"®S cells (left) and CD11b* Gr-1>8" 
(right) are shown. Error bars, s.e.m. *P < 0.05, **P < 0.01, ***P < 0.001. 
Unpaired Student’s t-test with Welch’s corrections (a-e, left), paired Student’s 
t-test (a-e, right), unpaired Student’s t-test (f). Data in f are representative of 
two independent experiments. 
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Extended Data Figure 3 | NRROS negatively regulates ROS production in 
human monocyte-derived macrophages (MDMs). a, b, Analysis of NRROS 
expression in total mRNA from human tissues (a) and cells purified from 
healthy donors (b) (n = 3). ¢, NRROS expression in MDMs stimulated as 
indicated (n = 3). d, NRROS expression in MDMs treated with non-targeting 
(Ctrl) or NRROS siRNA (siRNA). e, f, Kinetic graph of ROS production from 
cells shown in d. Cells were primed with IFN-y and were then either 


unstimulated or stimulated with zymosan (e) or PMA (f). g, h, ROS production 
by IFN-y-primed MDMs stimulated, in triplicate, with zymosan (g) and PMA 
(h) from three individual donors. Values at the peak production (60 min) 

are shown. Data in a—d are normalized to endogenous control RPL19. Error 
bars, s.e.m. *P < 0.05, **P < 0.01, ***P < 0.001. Unpaired Student’s t-test 
with (e, f) and without (g, h) Welch’s corrections. Data are representative of two 
independent experiments with three donors in each experiment. 
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Extended Data Figure 4 | NRROS regulates ROS production and 
bactericidal activities of phagocytes but not phagocytosis or nitric oxide 
production by them. a-c, ROS production in IFN-y-primed BMDMs 
stimulated with E. coli (a), S. typhimurium (b) and heat-killed L. monocytogenes 
(HKLM) (c). Averaged AUC from at least four independent experiments is 
plotted as bar graphs. d, In vitro E. coli killing by neutrophils purified from bone 
marrow of either WT or KO mice (n = 3). e, Nitric oxide (NO) in culture 
supernatant of BMDMs stimulated as indicated (n = 3). f, Representative 


histograms showing phagocytosis of serum-opsonized dsRed-E. coli by 
BMDMs. Thin line, untreated cells; thick line, treated cells. In the overlay 
histograms on the right: filled area, WT cells; thick line, NRROS-deficient cells. 
g, Left, NRROS expression in CD11b_” cells purified from spleens of uninfected 
WT and L. monocytogenes-infected WT and KO mice. Actin used as loading 
control. Right, densitometry analyses relative to actin. Error bars, s.e.m. 

**P < 0.01, ***P < 0.001. Paired (a—c) or unpaired (d, g) Student’s t-test. Data 
in d-g are representative of at least two independent experiments. 
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Extended Data Figure 5 | Myeloid-specific deletion of NRROS leads to 
exacerbated EAE. a, Schematic of the strategy used to generate Nrros mice. 
Numbers represent exons in Nrros gene. WT, Nrros“" LysM-Cre™’; KO, 
Nrros™" LysM-Cre?®. b, Western blot analysis of NRROS protein in immune 
cells from WT and KO mice. Actin used as loading control. Mac, macrophage; 
PEC, peritoneal macrophage. c-e, Clinical course (c), representative images 
of the spinal cord haematoxylin and eosin staining (d), and average 
histopathological lesion scores (day 30) (e) of EAE-induced WT (n = 20) and 
KO (n= 18) mice. Arrows in d indicate extensive myelinopathy of the lateral 
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white matter tracts. Luxol fast blue staining of myelin is in blue. f, g, Clinical 
course (f) and lesion severity as assessed by histopathology on day 30 (g) of 
EAE-induced WT and KO mice administered with either vehicle control 
(ctrl) (n = 12) or a cocktail of ROS scavengers (m = 13 for WT and n = 14 for 
KO) every 12h starting from day 9 (arrow) for the rest of the study. Error bars, 
s.e.m. *P < 0.05, ***P < 0.001. Two-way (c, f) or one-way (g) analysis of 
variance followed by Bonferroni’s post-hoc analysis, unpaired Student’s t-test 
(e). Data are representative of at least three (c-e) and two (b) independent 
experiments. Data in f, g are combined from two independent experiments. 
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Extended Data Figure 6 | NRROS regulates NOX2-mediated ROS but not 
mROS generation. a, b, mROS production by IFN-y-primed BMDMs from 
WT and NRROS-deficient (KO) mice stimulated with antimycin A (a) or 
rotenone (b) in the presence of MitoSOX. c-g, IFN-y-primed BMDMs were 
pre-treated for 1 h with antimycin A (c), DPI (d, e) or apocynin (f, g) before 
ROS induction by zymosan. (c, d, f). Left panels, ROS kinetic plots of a 
representative experiment with three independent samples per group; right 
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panels, averaged AUC from three independent experiments. e, g, Dose response 
of DPI (e) or apocynin (g) treatment in WT and KO cells. Values at the 
peak of the curve are used for the bar graph. Error bars, s.e.m. *P < 0.05, 
**P < 0.01. Unpaired Student’s t-test with Welch’s corrections (c, d, f, left), 
paired Student's t-test (c, d, f, right), unpaired Student's t-test (e, g). Data are 
representative of two (a, b, e, g) or three (c, d, f) independent experiments. 
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Extended Data Figure 7 | No difference between WT and NRROS-deficient 
BMDM:s in expression and activation of the cytosolic members of NADPH 
oxidase complex. a, Western blot analysis of p40P?°*, p47??™ and p67?>® in 
the lysates of BMDMs from WT and NRROS-deficient (KO) mice either 
unprimed (-) or primed (+) with IFN-y. Actin used as loading control. 

b, c, phospho-p47?"™ (P-p47?P**) and total p47?P** (b), active Rac and total 
Rac (c) in IFN-y-primed BMDMs stimulated with zymosan for the indicated 
time. Active Rac was immuoprecipitated and blotted. Total Rac in the lysates 
used as control. d, e, Taqman analysis of Cyba (p22PP°*) (d) and Cybb (NOX2) 
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(e) expression in total RNA from BMDMs either primed with IFN-y or left 
alone (Unprime). Expression levels were normalized to endogenous control, 
ribosomal protein L19 (Rpl19) (n = 3). £-h, Top panels, NOX2 and p22? 
expression in thioglycollate-elicited peritoneal macrophages (f), in splenocytes 
from mice infected with L. monocytogenes for 24h (g) and in total CNS cells 
from EAE-induced mice on day 15 post-immunization (h). Bottom panels, 
densitometry analyses relative to actin (nm = 3). Error bars, s.e.m. *P < 0.05, 
**P < 0.01 (unpaired Student’s t-test). Data are representative of three 

(a—c) and two (f-h) independent experiments. 


©2014 Macmillan Publishers Limited. All rights reserved 


LETTER 


b c d e 
~ es s > 
@2.0 215 O1.5)5 Sas 245 - So wt 
Bo e| rs i s = wp 
15 $ 10 810 B 1.0/5 os £10 mKO 
B40 ie o” te o 
fn 505 Y05 4 0.5 Sos 
one sv fe) g 
zoo MEBEEE 2 oo MABE zo. 0 MEEEEE S 0. oMEEEEEEE 9 0. oMBEEEEEE 
Ctrl 3 6 Cti 3! 6 Ctl 2 4 6 Ctl 2 4 6 Ctrl 2 4 6 
MG132 (h) MG132 (h) Chloroquine (h) Chloroquine (h) Chloroquine (h) 
g 
GFP-RAW Flag-NRROS-RAW_ Flag-NRROS-RAW 
WT KO Anti-Flag Isotype Anti-Flag 
EndoH -— + -— + a Craw] Jooox oaae [nnn nist 
1 Surface 
NOX2== 
Thee case) jon Poem [een sion 
aD 3 x 
pi] 7 a 
o | oe = Intracellular 
<x Ri ae . 
Me 2. ‘ —_* 
er 
GFP = Anti-Flag 


Calnexin 


Lamp1 


Extended Data Figure 8 | NRROS is an ER protein and regulates 
proteasome-mediated degradation of NOX2 and p22?™ proteins. 
a-e, Densitometry analyses relative to actin of NOX2 (a, c), p22Phox (b, d) and 
p62 (e) in IFN-y-primed BMDMs from WT and KO cells treated with MG132 
(a, b) or chloroquine (c-e) for the indicated period (n = 3). p62 is shown as 
positive control for chloroquine treatment. f, Autoradiography showing 
immunoprecipitated NOX2 from BMDMs metabolically labelled with 
[°°S]methionine for 1 h. The immunoprecipitates were subjected to Endo H 
treatment for 18h. Total proteins in the flow-through lysates is shown as 
loading control. g, RAW cells retrovirally transduced to stably express GFP 
(GFP-RAW) or Flag-NRROS (Flag-NRROS-RAW) were stained with either 
isotype control (second column) or anti-Flag antibody (first and third 


Merge 


Merge 


columns). Cells were either non-permeabilized (top panels) or permeabilized 
(bottom panels). Overlay of the three populations is shown on the right. Thick 
line, Flag-NRROS-RAW cells stained with anti-Flag; filled area, GFP-RAW 
cells stained with anti-Flag; dashed line, Flag-NRROS-RAW cells stained with 
isotype control. h, RAW cells stably expressing Flag-tagged NRROS were 
stained with anti-Flag antibody (Flag) (green) and one of the indicated 
organelle markers (red), calnexin (top row) and Lamp1 (bottom row) and 
imaged by confocal microscopy. Colocalization of the two colours is depicted 
by yellow colour in the merged images on the right column. Error bars, 
sem. *P < 0.05 (unpaired Student’s t-test). Data are representative of at least 


three independent experiments. 
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Extended Data Figure 9 | NRROS does not regulate NOX2 through the 
HSP90-HSP70-CHIP pathway. a, Western blot analysis of HSP90, HSP70 
and CHIP in total lysates from IFN-y-primed WT and KO BMDMs. Actin used 
as loading control. b, c, Zymosan-induced ROS production by IFN-y-primed 
BMDMs treated with the HSP90 inhibitor radicicol at 5 uM (b) and 20 UM (c). 


d, Left, NOX2, HSP90 and HSP70 expression in IFN-y-primed BMDMs treated 
with the indicated dose of radicicol. Actin used as loading control. Right, 
densitometry analyses relative to actin from three experiments. Error bars, 
s.e.m. *P < 0.05, ***P < 0.001. Unpaired Student’s t-test with (b, c) and 
without (d) Welch’s corrections. Data in a are representative of three 


A representative plot of kinetics of ROS production (three independently experiments. 


stimulated samples per group) from three independent experiments is shown. 
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Extended Data Table 1 | Analysis of immune cell subsets in lymph node, spleen, thymus and blood of WT and KO mice 


Lymph node WT (n=5) KO (n=5) 
CD4° 64.1 + 1.00 65.8 + 0.62 
CDs" 25.6 + 0.67 24.5+0.32 
B220° 14.9 + 0.86 15.6+1.14 
CD4°CD25" 2.5+0.10 2.7+0.13 
Spleen WT (n=5) KO (n=5) 
CD4" 43.0 + 0.98 45.1 + 0.66 
CDs" 24.0 + 0.68 21.0 + 0.86 
B220° 42.9 + 0.40 42.9 + 1.85 
y6 T cells 0.5 + 0.05 0.5 +0.05 
DX5" (NK cells) 4.0 + 0.25 4.02 + 0.13 
CD11b*F4/80° 4.1+0.22 4.3 40.07 
CD11b*CD1 1c" 0.5+0.39 0.5 + 0.03 
CD11b’ Gr-1° 3.2+ 0.34 3.3+0.11 


Thymus WT (n=5) KO (n=5) 


CcD4"CD8" 84.0+ 1.14 84.3 + 0.92 
CD4CD8& 6.7 + 0.78 7.140.59 
CD4*°CDs" 4.6 + 0.43 4.2+0.11 
CD4CD8" 4.4 + 0.38 4.8 + 0.44 
Complete Blood Count (CBC) WT (n=5) KO (n=5) 
White blood cells 27 = 167 10.0 + 1.19 
Neutrophils 9.7 2182 6.140.95 
Lymphocytes $1.32 171 81.6 + 2.87 
Monocytes 6.5 + 0.75 L302 
Eosinophils 1.1+0.29 0.6 + 0.35 
Basophils 1.4+0.17 0.7 + 0.44 


Single-cell suspensions of lymph node, spleen and thymus were stained with the indicated surface marker and analysed by FACS. Percentages of cells in the total population are shown. Haematological analysis of 
the blood (complete blood count) showing the percentage of total white blood cells and the percentages of different leukocyte populations therein is shown. All the values are average + s.e.m. 
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Endosomes are specialized platforms for bacterial 
sensing and NOD2 signalling 


Norihiro Nakamura', Jennie R. Lill', Qui Phung’, Zhaoshi J iang', Corey Bakalarski’, Ann de Mazieére’, Judith Klumperman/?, 


Megan Schlatter’, Lélia Delamarre’ & Ira Mellman! 


The detection of microbial pathogens involves the recognition of 
conserved microbial components by host cell sensors such as Toll- 
like receptors (TLRs) and NOD-like receptors (NLRs). TLRs are 
membrane receptors that survey the extracellular environment for 
microbial infections, whereas NLRs are cytosolic complexes that 
detect microbial products that reach the cytosol. Upon detection, 
both sensor classes trigger innate inflammatory responses and allow 
the engagement of adaptive immunity'”. Endo-lysosomes are the 
entry sites for a variety of pathogens, and therefore the sites at which 
the immune system first senses their presence. Pathogens interna- 
lized by endocytosis are well known to activate TLRs 3 and 7-9 that 
are localized to endocytic compartments and detect ligands present 
in the endosomal lumen’. Internalized pathogens also activate sen- 
sors in the cytosol such as NOD1 and NOD2 (ref. 2), indicating that 
endosomes also provide for the translocation of bacterial compo- 
nents across the endosomal membrane. Despite the fact that NOD2 
is well understood to have a key role in regulating innate immune 
responses and that mutations at the NOD2 locus are a common risk 
factor in inflammatory bowel disease and possibly other chronic 
inflammatory states*”, little is known about how its ligands escape 
from endosomes. Here we show that two endo-lysosomal peptide 
transporters, SLC15A3 and SLC15A4, are preferentially expressed 
by dendritic cells, especially after TLR stimulation. The transporters 
mediate the egress of bacterially derived components, such as the 
NOD2 cognate ligand muramyl dipeptide (MDP)*”’, and are selec- 
tively required for NOD2 responses to endosomally derived MDP. 
Enhanced expression of the transporters also generates endosomal 
membrane tubules characteristic of dendritic cells, which further 
enhanced the NOD2-dependent response to MDP. Finally, sensing 
required the recruitment of NOD2 and its effector kinase RIPK2 
(refs 8, 9) to the endosomal membrane, possibly by forming a com- 
plex with SLC15A3 or SLC15A4. Thus, dendritic cell endosomes are 
specialized platforms for both the lumenal and cytosolic sensing of 
pathogens. 

Although cytosolic NLRs monitor the presence of bacterial patho- 
gens, only a minority of pathogenic or commensal bacteria reaches the 
cytosol, raising the question of how NLR ligands escape from endosomes 
after bacterial internalization. To determine if entry was required, we 
transfected 293 cells with NOD2 and an NF-«B luciferase reporter, and 
cultured them with invasive Salmonella typhimurium, well known to 
activate the NOD2 pathway. As shown in Fig. la, wild-type S. typhi- 
murium strongly induced NF-«B even at low ratios of bacteria to 293 
cells. Activation of NOD2 was dependent on host cell entry, because an 
invasion-defective S. typhimurium mutant (AinvG)"° did not induce NF- 
«B (blue circles), similarly to a non-invasive bacterium (Escherichia coli; 
green circles). The failure of the dinvG mutant to trigger NOD2 sig- 
nalling indeed reflected a failure of entry. First, blocking internalization 
of wild-type S. typhimurium by cytochalasin D also inhibited NOD2 
activation (Fig. 1b), and without affecting signalling by soluble MDP 
added extracellularly (Extended Data Fig. 1a). Second, when the AinvG 
entry defect was complemented by expressing Yersinia invasin, which 


triggers phagocytic entry by binding to {1 integrins on target cells’’, 
NOD2 activation was completely restored (Extended Data Fig. 1b). After 
internalization, wild-type S. typhimurium was found within LAMP1- 
positive phagolysosomes (Extended Data Fig. 1c), indicating that ligands 
such as MDP do not gain access to NOD2 by phagosome disruption. 
Indeed, when the phagosome disruption occurs (as in the sifA mutant’’), 
there is a marked activation of NOD2 even at very low bacterial expo- 
sures (Extended Data Fig. 1d-f). Although these observations indicate 
the involvement ofa transporter under normal conditions, the relevant 
transporter was not bacterially encoded. NOD2 activation did not require 
the S. typhimurium effector proteins SipA and SopE (Extended Data 
Fig. 1g), recently suggested to activate NOD2 and MAP-kinase after trans- 
location to the cytosol via the bacterial type III secretion system’*"*. More 
directly, phagocytosed MDP-coated 1-1m latex beads were found to 
trigger NOD2-dependent activation of NF-«B even in the absence of 
bacteria (Fig. 1c). Cytochalasin D as well as acidification inhibitors blocked 
NF-«B activation and/or internalization of the beads (Fig. 1c, d and 
Extended Data Fig. 1h), strongly implying the involvement of endo- 
somal transporters for MDP. 

To characterize candidate transporters in endo-lysosomes, we per- 
formed quantitative SILAC (stable isotope labelling by amino acids in 
cell culture) proteomic analysis of endocytic compartments in dendritic 
cells. An enriched endo-lysosomal fraction was prepared by magnetic 
bead isolation after internalization of 200-nm iron oxide particles by 
isotopically labelled bone marrow-derived dendritic cells (BMDCs) + 
lipopolysaccharide (LPS) stimulation (Extended Data Fig. 2a—c). Out 
of 1,436 proteins identified in the endo-lysosome fractions, 8 and 45 
were annotated as ATP-binding cassette (ABC) transporters and solute 
carrier family (SLC) transporters, respectively, in the UniProt database 
(Supplementary Table 1). Of these, five (ABCA1, ABCA3, SLC3A2, SLC7A2 
and SLC15A3) were upregulated more than twofold by LPS treatment 
(Extended Data Fig. 2d), and showed a predominant localization to 
endocytic compartments (Extended Data Fig. 3a). The others were largely 
restricted to sites such as the endoplasmic reticulum (TAP1 and TAP2) 
or the plasma membrane (SLC7A11). Quantitative PCR revealed that 
only one candidate, SLC15A3, was expressed predominantly by den- 
dritic cells. SLCI5A3 expression at the messenger RNA level increased 
2.5-fold following LPS treatment of mouse BMDCs; it was also detected 
ina macrophage cell line (J774.2) (Fig. le, upper panel). In human cells, 
SLC15A3 was expressed by myeloid dendritic cells as well as by plas- 
macytoid dendritic cells (Fig. le, lower panel), but was not detected in 
other human or mouse cell types. SLC15A3 is a member of the proton- 
coupled peptide symporter family consisting of four genes, SLC15A 1 to 
SLCI5A4 (ref. 15). SLC15A4, a closely related family member, was also 
found in the endo-lysosome fraction albeit at lower induction by LPS 
treatment (Extended Data Fig. 2d). SLC15A3 and SLC15A4 were clearly 
localized to endosomes, lysosomes and endo-lysosomal tubules in BMDCs 
(Fig. 1g-i). Although SLC15A4 was not upregulated by LPS, it was con- 
stitutively highly expressed in plasmacytoid dendritic cells; it was also 
expressed at low levels in the 293 cells used for our reporter assays (Fig. 1f). 
The two other members of the family, SLC15A1 (also known as PepT1) 
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Figure 1 | Recognition of MDP from Salmonella and MDP-beads 
internalized into SLC15A3* and SLC15A4* endosomes. a, NE-«B- 
luciferase reporter assay of 293 cells transfected with an NF-«B-luciferase and 
either NOD2 or vector. Cells were cultured with S. typhimurium SL1344 

(S.t. WT), invG mutant (AinvG), or E. coli BL21 (E.c.) for 1h, then incubated 
overnight in fresh medium containing gentamycin. b, NOD2-transfected 293 
cells were pre-incubated with or without cytochalasin D, then stimulated 
with S. typhimurium. c, NOD2- or vector-transfected cells were incubated with 
MDP-beads or uncoated beads (same amount to MDP-beads at 100 ng 

MDP ni’) with or without cytochalasin D. All data are presented as fold 
relative to unstimulated NOD2-transfected cells. Without NOD2 cDNA, 
NF-«B activation was negligible even in the presence of MDP. d, NOD2- 
transfected cells were pre-incubated with bafilomycin A1 or chloroquine. 

e, f, Expression of SLC15A3 (e) and SLC15A4 (f) in mouse and human cells was 
quantified by RT-PCR. (+) denotes LPS-stimulated dendritic cells. mDC, 
myeloid dendritic cells; pDC, plasmacytoid dendritic cells. g-i, Microscopy of 
BMDCs expressing SLC15A3-RFP and LAMP1-GFP (g), SLC15A3-GFP 

(h) or SLC15A4-GFP (i). Endo-lysosomes are labelled with dextran—Alexa647 
or OVA-Alexa555. Single and double arrows mark vesicular and tubular 
endo-lysosomes. Bars, 5 jum. Unless noted otherwise data represent the mean of 
triplicates + s.d. The data are representative of three independent experiments. 


and SLC15A2 (also known as PepT2) localized to the plasma membrane, 
were not specific to dendritic cells, and were not found in the endo- 
lysosome fraction (Extended Data Fig. 3b, cand Supplementary Table 1). 

SLC15A3 and SLC15A4 belong to a transporter family that co-transports 
di- or tri-peptides and protons across membrane using a proton gradient 
as the driving force'’. Endosomes and lysosomes are acidic'*”’, suggest- 
ing that SLC15A3 and SLC15A4 transport small peptides to the cytosol 
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using the outward-directed proton gradient. We first asked if the expres- 
sion of SLC15A3 enhanced the release of endocytosed MDP conju- 
gated to rhodamine from endosomes. MDP-rhodamine was loaded 
into endosomes by endocytosis in cells expressing green-fluorescent- 
protein-conjugated SLC15A3-GFP or LAMP1-GFP (Fig. 2a). The rho- 
damine fluorescence in SLC15A3-GFP* endosomes decreased more 
rapidly (Fig. 2b). We next expressed mutant transporters that localized 
to the plasma membrane following disruption of cytoplasmic di-leucine 
motifs'* required for endo-lysosomal localization (Fig. 2c and Extended 
Data Fig. 4a, b). When cells were incubated with MDP-rhodamine in 
acidic medium (pH 5.5), accumulation of MDP-rhodamine was sig- 
nificantly enhanced in cells expressing plasma membrane mutants of 
SLC15A3 (Fig. 2d). Only a slight increase in endocytosis was observed 
in cells expressing endosomal SLC15A3. Consistently, cells expressing 
the plasma membrane mutant were hyper-responsive to extracellular 
soluble MDP (Extended Data Fig. 4c, see also below). Thus, expression 
of transporter at cell surface increased uptake of free MDP from the 
culture medium, consistent with a directionality for transport of lumi- 
nal MDP in endo-lysosomes to cytosol by wild-type transporters. 

Transfection ofa complementary DNA encoding wild-type SLC15A3 
or SLC15A4 enhanced the NF-«B-luciferase signal induced by S. typhi- 
murium or MDP-coated beads (Fig. 2e). Similar results were obtained 
using endogenous cytokine release (IL-6, TNF-«) as indicators of NOD2 
signalling in Raw 264.7 macrophages, which express low levels of endo- 
genous SLC15A3 and SLC15A4 (Fig. 2f and Extended Data Fig. 5a). 
SLC15A3 and SLC15A4 did not facilitate TLR and TNF-« signalling in 
293 cells stimulated with flagellin and TNF-a (Extended Data Fig. 5b) 
or Raw 264.7 cells stimulated with LPS (Extended Data Fig. 5a), con- 
sistent with the specificity of the transporters for NOD2 ligands. Next, 
we generated BMDCs from a mouse strain deficient in SLC15A3 and 
assayed the ability of MDP-beads to elicit the cytokine release. When 
wild-type dendritic cells were incubated with MDP-beads, IL-6 was pro- 
duced in a dose-dependent manner (Fig. 2g, closed bars). However, the 
production of IL-6 by SLC15A3-deficient dendritic cells was substan- 
tially lower than wild-type cells (open bars). Similarly, the production 
of IL-1B was also substantially lower in SLC15A3-deficient dendritic 
cells (Extended Data Fig. 5c). These results indicate that SLC15A3 facil- 
itates the recognition of internalized MDP. SLC15A3-deficient BMDCs 
took up similar numbers of MDP-beads compared to wild-type, indi- 
cating that endocytic uptake was unaffected by the absence of SLC15A3 
(Extended Data Fig. 5d). LPS- or Salmonella-induced production of 
IL-6 was also unaffected in SLC15A3-deficient dendritic cells (Extended 
Data Fig. 5e). The residual signalling seen in the knockout dendritic 
cells probably reflected the residual expression of SLC15A4 in these cells. 
The SLC15A3-deficient mice themselves exhibited no overt phenotype, 
at least under steady state conditions. 

One of the most striking features of dendritic cell endo-lysosomes is 
their ability to extend long tubules following LPS treatment’*’. Indeed, 
endo-lysosomes isolated from LPS-stimulated BMDCs contained numer- 
ous tubular structures (Extended Data Fig. 2b, arrows). Because SLC15A3 
was strongly upregulated upon LPS stimulation, we asked if these two 
events were related. Remarkably, cells transfected with SLC15A3 exhib- 
ited abundant tubular structures compared to untransfected cells (Fig. 3a). 
Expression of SLC15A4 also generated tubules (Fig. 3b), although other 
endo-lysosomal transporters did not (see Extended Data Fig. 3a). Tubule 
induction was efficient, with 53% and 47% of 293 cells expressing SLC15A3 
and SLC15A4 exhibiting tubules (Fig. 3c). Conversely, in BMDCs from 
Sle15a3‘~ mice, fewer than 30% of total cells exhibited tubules after 
LPS treatment, whereas ~80% of wild-type BMDCs exhibited tubular 
endo-lysosomes (Fig. 3e). The reduced tubulation was rescued by express- 
ing SLC15A3-GFP, but not by the GFP empty vector (Fig. 3d). Taken 
together, these gain and loss-of-function experiments provide strong 
evidence that the induction of SLC15A3 and SLC15A4 in response to 
stimuli such as LPS results in the formation of endo-lysosomal tubules. 
We asked if tubulation itself might have significance for NOD2 sens- 
ing. Because the extensions tracked along cytoplasmic microtubules 
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Figure 2 | SLC15A3 and SLC15<A4 facilitate the 
recognition of phagosomal MDP. a, b, 293 cells 
expressing SLC15A3-GFP (a, b) or LAMP1-GFP 
(b) were incubated with MDP-rhodamine, then 
chased in fresh medium. Rhodamine in GFP- 
positive endosomes (arrows) were quantified 

by fluorescence microscope and Image] (b). 
Means + s.d. **Two regression slopes differ 
significantly (P = 0.006, n = 20 endosomes; t-test). 
Bars, 5 um. ¢, Microscopy of di-leucine mutants of 
human (left) or mouse (right) SLC15A3-GFP in 
293 cells. Arrows indicate cell surface. d, Cells 
transfected with SLC15A3, di-leucine mutants 

or vector were incubated with soluble MDP- 
rhodamine in medium (pH 5.5) for 30 min, then 
the uptake was quantified by a fluorophotometre. 
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(Extended Data Fig. 6), it was not surprising that nocodazole treatment, 
which inhibits the polymerization of microtubules, collapsed the tubular 
network (Fig. 3f). Interestingly, nocodazole also reduced the activation 
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e, 293 cells transfected with an NF-«B-luciferase, 
NOD2, and either SLC15A3, SLC15A4 or vector, 
are incubated with S. typhimurium (multiplicity of 
infection = 10) or MDP-beads (1 bg MDP ml7'). 
White bars represent results for the same 
conditions but with the addition of cytochalasin D. 
f, Raw 264.7 cells stably expressing SLC15A3 or 
SLC15A4 were stimulated with MDP plus 

10ng ml | LPS for 24h. g, SLC15A3-deficient or 
wild-type BMDCs were incubated with MDP- 
beads. IL-6 in medium was quantified by ELISA. 
Means = s.d. *P < 0.05 and **P < 0.01 by t-test. 
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of NF-«B enhanced by SLC15A3 in cells simulated with Salmonella or 
MDP-beads (Fig. 3g, SLC15A3), although this enhancement did not 
occur in cells where tubules were not present, (Fig. 3g, vector). Thus, 
endo-lysosomal tubules generated by transporter expression further 
enhance MDP escape or detection, perhaps by increasing the surface 
area of the compartment”””’. 

We next determined that there was a spatial relationship between 
NOD2 and bacteria-containing phagosomes. Because endogenous NOD2 
is expressed at low levels, 293 cells expressing red-fluorescent-protein- 
conjugated RFP-NOD2 were used for these experiments, the fusion 
protein having been found functional for NF-«B signalling (see Extended 
Data Fig. 7e). NOD2 was associated with Salmonella-containing pha- 
gosomes (Fig. 4a, b), whose membranes were also positive for LAMP1 
and SLC15A4 (Fig. 4c and Extended Data Fig. 7a). Additional pools of 
NOD2 were found associated with the plasma membrane and diffusely 
distributed in the cytosol (Fig. 4a), consistent with earlier work that sug- 
gested NOD1 recruitment at sites of Shigella flexneri attachment. NOD1, 
which recognizes y-D-glutamyl- diaminopimelic acid in peptidoglycan**”*, 
was also found on Salmonella-containing phagosomes (Extended Data 
Fig. 7b). Association of NOD2 with the plasma membrane requires a 


Figure 3 | SLC15A3 and SLC15A4 generate endo-lysosomal tubules. 

a, b, Microscopy of 293 cells expressing SLC15A3-GFP or SLC15A4-GFP. 
Arrows mark tubules with OVA-Alexa555. c, Frequencies of cells with 
endo-lysosomal tubules in the transfected 293 cells. Means = s.d. (n = 80 cells 
in triplicates). n.d., not detected. d, Microscopy of wild-type or SLC15A3- 
deficient BMDCs stimulated with LPS. GFP or SLC15A3-GFP was expressed in 
SLC15A3 knockout BMDCs. e, Frequencies of BMDCs with tubules. For 
SLC15A3-deficient cells, percentages were calculated as a function of 
GFP-positive cells. f, 293 cell expressing SLC15A3-GFP was imaged before and 
after 10 UM nocodazole treatment (17 min). Arrows indicate tubules. g, 293 
cells transfected with NOD2 and either SLC15A3 or vector, were pre-incubated 
with or without nocodazole, then stimulated with S. typhimurium or 
MDP-beads. Means = s.d. *P < 0.05 and **P < 0.01. Bars, 10 jim (a, b, d) and 
5 um (f). 
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leucine-rich repeat domain found at the NOD2 carboxy terminus”. A 
frameshift mutation (3020insC) that causes a truncation of the leucine- 
rich domain is a risk factor for inflammatory bowel disease*”*, suggest- 
ing that the membrane association of NOD2 is important for pathogen 
sensing. Indeed, the 3020insC variant was not associated with Salmonella- 
containing phagosomes, but was largely found in the cytosol (Fig. 4d). 
These results were confirmed by immunoelectron microscopy (Fig. 4b). 
In cells expressing RFP-NOD2, 0.88 gold particles per phagosome were 
detected whereas in cells expressing mutant NOD2-3020insC, only 0.22 
gold particles per phagosome were found (t-test, P = 0.001); the resid- 
ual association in mutant-expressing cells probably reflected the phys- 
ical proximity of unbound soluble NOD2-3020insC. Cell fractionation 
showed that full-length NOD2 co-fractionated with the late endosomal- 
lysosomal marker LAMP 1 or LAMP2: ~20% of RFP- or Flag-NOD2 
was found in the phagosome fraction of 293 and BMDCs, compared 
with only 4% of tagged NOD2-3020insC (Extended Data Fig. 7c). Con- 
sistently, NOD2 was associated with the LAMP2-positive phagosomes 
that contain internalized MDP-beads in BMDCs (Extended Data Fig. 7d). 
Importantly, cells expressing mutant NOD2-3020insC were largely defi- 
cient at NF-«B activation in response to S. typhimurium entry (Extended 
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Data Fig. 7e). Thus, the association of NOD2 with pathogen-containing 
endocytic compartments is an important feature of MDP sensing. 
We next asked if NOD2 might be in a complex with SLC15A3 or 
SLC15A4. Indeed, NOD2 was detected in immunoprecipitates of both 
SLC15A3 and SLC15A4 (Fig. 4e, left); similarly, SLC15A3 and SLC15A4 
could also be detected in NOD2 immunoprecipitates. Importantly, the 
abundant endo-lysosomal protein LAMP! was not associated with NOD2, 
providing support for the specificity of the SLC15-NOD2 association 
(Fig. 4e, right). Consistent with localization of NOD2-3020insC vari- 
ant in the cytosol (Fig. 4d), NOD2-3020insC protein was greatly reduced 
in the immunoprecipitates of SLC15A3 and SLC15A4 (Fig. 4e, left). 
SLC15A3 and SLC15A4 co-immunoprecipitated SLC15A4 and SLC15A3, 
respectively (Fig. 4f), indicating that SLC15A3 and SLC15A4 can form 
heterodimers with each other. Finally, NOD2 binds an effector protein 
kinase RIPK2, which is essential for activating downstream signalling 
via NF-«B and microRNA-29*””*, When RFP-NOD2 was expressed 
with GFP-RIPK2 in 293 cells, the RIPK2 was found in endo-lysosomes 
that contained NOD2 (Fig. 4g). In the absence of NOD2 expression, 
GFP-RIPK2 was low or absent in endosomes (Extended Data Fig. 7f), 
indicating that RIPK2 is recruited to endosomes by NOD2, suggesting 


Figure 4 | NOD2 forms a sensor complex on 
endosomes. a-d, Fluorescence microscopy 

(a, c, d) or immuno-electron microscopy (b) of 
NOD2? in Salmonella-infected 293 cells. Cells stably 
expressing RFP-NOD2 (a-c) or RFP-NOD2- 
3020insC (d) and SLC15A4—GFP (c) were 
incubated with S. typhimurium expressing GFP 
(a, b, d) or tagBFP (c) as in Fig. 1. RFP-NOD2 

in b was identified by anti-RFP antibody 

and protein A-gold (10 nm). e, f, Co- 
immunoprecipitation of NOD2, SLC15A3 and 
SLC15A4. Tagged proteins were expressed in 

293 cells, isolated by anti-GFP (e) or anti-Flag 

(f) beads (immunoprecipitation, IP), then 
immunoblotted (IB). g, Fluorescence microscopy 
of 293 cells expressing either GFP-RIPK2 or 
RFP-NOD2. Endo-lysosomes are labelled with 
dextran-Alexa647 (arrowheads). Asterisk (*) 
indicates a cell expressing RFP-NOD2 but not 
GFP-RIPK2. Bars, 5 jim (a, ¢, d, g) and 0.2 um (b). 
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that membrane transporters, NOD2 and the effector protein kinase 
contribute to the assembly of the signalling complex on endosomes. 

Our genetic and functional analysis has demonstrated that MDP trans- 
port from endosomes and lysosomes was dependent on two dendritic- 
cell- and macrophage-specific endo-lysosomal transporters, SLC15A3 
and SLC15A4. Importantly, MDP sensing also required the recruitment 
of NOD1 and NOD2, and accordingly RIPK2, to transporter-containing 
compartments. Thus, endo-lysosomal membranes in dendritic cells (and 
macrophages) are not only specialized to enable the selective escape of 
key bacterial components, but also to serve as platforms that assemble 
critical microbial sensors and downstream effectors. Together with the 
fact that the transporters induce a marked tubulation of endosomal 
compartments to enhance the sensitivity of MDP detection, our results 
establish the conceptual importance of endosomes as signalling plat- 
forms specialized for triggering innate immune responses. 


METHODS SUMMARY 


MDP-beads were prepared by mixing tosyl-activated Dynal MyOne beads (Invi- 
trogen) and MDP-rhodamine B (InvivoGen). For reporter assay, 293 cells (40 X 10°) 
stably transfected with pNIFTY2-Luc and pUNO-hNOD2a (InvivoGen) were used. 
For co-expression with SLC15 transporters, 293 cells were transiently transfected 
with pNIFTY2-Luc, pUNO-hNOD2a, and either plasmid of EGFP, SLC15A3-EGFP, 
or SLC15A4-EGFP using Lipofectamine 2000. Cells were incubated with MDP or 
MDP-beads for 7 h, or S. typhimurium (SL1344) or E. coli (BL21) for 1h. The cells 
incubated with bacteria were washed with fresh medium, then cultured in the pre- 
sence of 20 jig ml * gentamycin for 24h. Luciferase activity was expressed as fold 
activation relative to the activity of unstimulated NOD2-transfected cells. BMDCs 
were differentiated from bone marrow using GM-CSF. Cytokine secretion from 
BMDCs (200 X 10°) or Raw 264.7 (100 X 10°) were measured by ELISA. The amount 
of beads in each culture was adjusted by adding non-coated beads. 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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METHODS 

Materials. MDP and E. coli LPS were obtained from InvivoGen and Sigma. MDP- 
beads were prepared by mixing tosyl-activated Dynal MyOne beads (Invitrogen) 
and MDP-rhodamine B (InvivoGen). The beads absorbed 4.4 fg MDP-rhodamine 
per bead. Antibodies were purchased from following companies: LAMP 1 (25) and 
syntaxin-6 (30), BD Bioscience; Na‘/K*-ATPase «1 subunit (C464.6), Upstate; 
transferrin receptor (H68.4), Zymed; GAPDH (6C5), Ambion; Flag-tag (M2), Sigma. 
Polyclonal antibodies against GFP (632459) and tagRFP (AB233) were obtained from 
Clontech and Evrogen. Anti-MHCII IAb polyclonal and anti-LAMP-2 monoclonal 
(clone GL2A7) antibodies are described previously”. 

Animals and cell culture. Research involving animals complied with protocols 
approved by the Genentech Institutional Animal Care and Use Committee. BMDCs 
were differentiated from bone marrow of 7-12 weeks old mice using GM-CSF 
(peprotech)*". BMDCs expressing MHCII-GFP were prepared from MHCII-GFP 
knock-in mice’. Human dendritic cells were obtained from blood”. All proce- 
dures used in this study complied with federal guidelines and institutional policies 
of the Genentech Animal Care and Use Committee. Gene expression was analysed 
using TaqMan RT-PCR: SLC15A1 (assay ID: Hs00192639_m1), SLC15A2 (assay ID: 
Hs00221539_m1), SLC15A3 (assay IDs: Hs00275455_m1 and Mm00491666_m1), 
SLC15A4 (assay IDs: Hs00377326_m1 and Mm00505709_m1), and GAPDH (assay 
IDs: Hs99999905_m1 and Mm99999915_g1) were obtained from Applied Biosys- 
tems. S. typhimurium and E. coli were cultured in LB at 37 °C. Overnight culture of 
bacteria was diluted 20-fold in LB containing 0.3 M NaCl, and incubated for 2h 
before use. InvG, SifA, SipA and SopE genes were disrupted using a A-Red plasmid 
pSIM18*. Targeting cassettes that comprise kanamycin-resistance gene flanked 
by LoxP sites and homology arms, were generated by PCR using pBlight-TK™ and 
the following primers: InvG: targeting vector, 5’-agtgagatcaaagagctgatcggcgtttcgc 
cgcggaaattatcaaatattattcaattggcagacaaatgaataacttcgtataatgtatgctatacgaagttatctgttg 
acaattaatcatcggcat-3' and 5’-gatgagatgtctgccgggacaatattctggaaaatgaaataccggagett 
gagccaggaatcatttaattgcataacttcgtatagcatacattatacgaagttattcagaagaactcgtcaagaaggcg-3’; 
genotyping, 5’-cacccaattagccgttaatcatge-3’ and 5’-ggcctgttgtatttccgcctgttg-3’. SifA: 
targeting vector, 5’-cgcagttgagataaaaagggtcgatttaatcaattatgtagtcatttttactccagtata 
agtgagattaatataacttcgtataatgtatgctatacgaagttatctgttgacaattaatcatcggcat-3' and 5’- 
actcagtaggcaaacaggaagtacetgagtaaaccctgaacetgacetctgagaaagcetcetctgattttaataact 
tcgtatagcatacattatacgaagttattcagaagaactcgtcaagaagecg-3'; genotyping, 5’-tgcgttta 
tacgcgaagctctcag-3’ and 5’ -gactattctcatccgatccggtca-3’. SopE: targeting vector, 5’ - 
tgctataaaaaataaaagcattcctgctatctatatataaatgaattatgtacatataaaaggatcattaccataacttcg 
tataatgtatgctatacgaagttatctgttgacaattaatcatcggcat-3' and 5'-gggeaaattatggatactatt 
ggttcatattaatcaggaagagectccgcatattttttgetttttcagtgtataacttcgtatagcatacattatacgaagt 
tattcagaagaactcgtcaagaaggcg-3'; genotyping, 5’-attgaatcagccgaatttttctaat-3’ and 
5'-gacggtttagctccggagttagttc-3'. SipA: targeting vector, 5'-agcagtacgataagtagcagc 
ctggaaaccgccaaaagcttcctgcaaggataacagaagageatattaataataacttcgtataatgtatgctatacga 
agttatcgcggaacccctatttgttt-3’ and 5’-aatacaagagetgatcacttttttgactcttgcttcaatatccat 
attcatcgcatctttcccggttaattaataacttcgtatagcatacattatacgaagttatggtctgacagttaccaatgc-3’; 
genotyping, 5’-aaaaatatagcaatgccaattcattgtacg-3' and 5’-agatcctcaaccagatgggtctg 
ac-3’. The kanamycin-resistance gene was removed from the chromosome by trans- 
forming 705-Cre (Gene Bridges). Gene disruption was confirmed by genomic PCR. 
A low copy plasmid, pSTV29 (Takara) was used to express EGFP, tagBFP, SifA and 
invasin gene of Y. pseudotuberculosis (NCTC 10275). SifA and invasin genes were 
amplified from genomic DNAs of S. typhimurium and Y. pseudotuberculosis, respect- 
ively, by PCR using primers: SifA, 5’-gggagatctacacgcatccaggcatgaagtttattc-3' and 
5'-gggeaattcttataaaaaacaacataaacagccgctttg-3’; Invasin, 5’-gggegatcccgtatcatgaac 
ggatacccagcac-3’ and 5'-gecggatccaagectggetattgc-3’. Bacteria were transformed 
using Gene Pulser (Bio-Rad). 

Reporter assay. 293 cells (40 X 10°) stably transfected with pNIFTY2-Luc and pUNO- 
hNOD2a (InvivoGen) were used for the reporter assay. For co-expression with SLC15 
transporters, 293 cells were transiently transfected with pNIFTY2-Luc (25 ng), 
pUNO-hNOD2a (1 ng), and either plasmid of EGFP, SLC15A3-EGEP, or SLC15A4- 
EGFP (1.8 ng) using Lipofectamine 2000. Cells were incubated with MDP or MDP- 
beads for 7 h, or S. typhimurium or E. coli for 1h. Cells incubated with bacteria were 
washed with fresh medium, then cultured in the presence of 20 pgml~’ gentamycin 
for 24h. Cytochalasin D (10 uM, 1h), nocodazole (10 11M, 30 min), bafilomycin Al 
(1 uM, 30 min) or chloroquine (10 1M, 30 min) was added before the stimulation. 
Luciferase activity was measured by Dual-Glo luciferase assay kit (Promega), then 
expressed as fold activation relative to the activity of unstimulated NOD2-transfected 
cells. Production of IL-6, IL-18, and TNF-« from BMDCs (200 X 10°) or Raw 264.7 
(100 X 10°) were measured by ELISA (R&D systems). The amount of beads in each 
culture was adjusted by adding non-coated beads. 

Mass spectrometry of endo-lysosomes. SILAC was performed as described previ- 
ously*>~’, BMDCs were cultured in heavy lysine-'?C,,!°N2 and arginine- BCPNa 
or light lysine-'?C,,'*N3 and arginine-'*C,,'“N> for 7 days. Cells were incubated 
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with 0.3 mg ml! iron oxide particles of 200-nm diameter (Ademtech) and oval- 
bumin-Alexa555 for 2.5 h, chased for 1.5h, then stimulated with (heavy labelled) 
or without (light labelled) LPS (30ng ml ) for 3 h. Cells were lysed in 10 mM HEPES- 
KOH, pH 7.4, containing 250 mM sucrose and, 1 mM EDTA (Total), then spun at 
120g. The post-nuclear supernatants (PNS) were applied to a MACS MS column 
on a magnet holder (Miltenyi). Endo-lysosomes were eluted from the column by 
separating the magnet. Heavy and light labelled endosomal proteins were mixed in 
a 1:1 ratio, resolved by SDS-PAGE, then treated with trypsin. Peptides were sepa- 
rated on a NanoAcquity UPLC (Waters), and analysed using an LTQ XL-Orbitrap 
mass spectrometer (ThermoFisher). Precursor ions were analysed in the FTMS ata 
resolution of 60,000 M/AM at m/z 400. Tandem mass spectral data was submitted 
for Mascot database (Matrix Science)**, then quantified using VistaQuant as previ- 
ously described”. 

Fluorescence microscopy. Microscopic images were taken using live cells unless 
specified. GFP- and RFP-fusion proteins were expressed in BMDC using retro- 
virus produced by pBMN-Z or pQCXIP and Phoenix-Eco or GP2-293. To label 
endo-lysosomes, cells were incubated with 20 pg ml’ of ovalbumin- or dextran- 
Alexa conjugate (Invitrogen), then chased with fresh medium for 30 min. BMDCs 
were stimulated with LPS (30ngml”') before microscopy. To measure MDP- 
rhodamine in endo-lysosomes, 293 cells expressing SLC15A3-GFP or LAMP1- 
GFP were incubated with MDP-rhodamine (1 pg ml’) overnight, then chased 
with fresh medium without MDP-rhodamine on Leica SP5. MDP-rhodamine in 
GFP-positive endosomes were quantified by ImageJ. 

MDP-uptake assay. COS7 cells (0.16 X 10°) were transfected with expression 
plasmid of either GFP, SLC15A3-GFP or the di-leucine mutants of SLC15A3-GFP, 
then cultured overnight. Cells were washed with transport buffer (5 mM p-glucose, 
140 mM NaCl, 1 mM MgSO,, 20 mM potassium phosphate, pH 7.4), then incu- 
bated with 10 pg ml” ' MDP-rhodamine in the buffer at pH 5.5 at 37 °C for 30 min. 
Cells were washed with ice-cold buffer, then lysed in 0.1% Triton X-100 and 0.1% 
NP-40. MDP-rhodamine was quantified by a fluorophotometre. 

Electron microscopy. The isolated endo-lysosome fraction was fixed in 2% glu- 
taraldehyde, post-fixed in 2% OsO, and en bloc stained in 1% uranyl acetate. For 
immunoelectron microscopy, cells were fixed with 4% paraformaldehyde, embedded 
in 12% gelatin, cryoprotected with 2.3 M sucrose, then frozen in liquid nitrogen. 
Ultrathin cryosections on copper grids were incubated with anti-tagRFP antibody 
followed by protein-A gold, as described earlier*’. The sections were then con- 
trasted with a 1.8% methylcellulose, 0.4% uranyl acetate mixture. 
Immunoprecipitation. 293 cells (3 X 10°) were transfected with empty or expres- 
sion plasmids of NOD2, SLC15A3, SLC15A4, or LAMP1, lysed in a buffer contain- 
ing 40 mM TrisCl (pH 7.4), 0.5% NP-40, 50 mM NaCl, and 5 mM MgCl, then applied 
to LMACS GFP Isolation Kit (Miltenyi) or EZview anti-Flag M2 affinity gel (Sigma) 
according to the manufacturer’s instruction. 

SLC15A3 knockout mouse. Mice carrying a floxed Slc15a3 were generated from 
an embryonic stem cell clone (Sic15a3°”"!/@““0MP)W) of Knockout Mouse Project 
Repository, then intercrossed with C57BL/6-Gt(ROSA)26Sor'™!(")4" Cre strain 
(Taconic) to produce knockout allele. Homozygous knockout mice are born at the 
Mendelian frequencies. 
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Extended Data Figure 1 | Phagocytosed Salmonella strongly activates 
NOD2. a, NOD2-transfected 293 cells were pre-incubated with or without 
cytochalasin D, then stimulated with soluble MDP. b, NOD2-transfected 293 
cells were stimulated with wild-type or invG mutant of S. typhimurium 
harbouring a vector or a plasmid carrying Yersinia invasin gene (pInvasin) as in 
Fig. 1. c, d, 293 cells expressing LAMP1-GFP were incubated with wild-type 
(c) or sifA mutant (d) of S. typhimurium expressing tagBFP for 1h, then 
cultured overnight in the presence of cultured for 24h in the presence of 

20 ugml~' gentamycin. The cells were observed under a fluorescence 
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microscope. Bars, 5 jim. e, NOD2- or vector-transfected 293 cells were 
stimulated with wild-type or sifA mutant of S. typhimurium as in Fig. 1. 

f, NOD2-transfected 293 cells were stimulated with wild-type or sifA mutant of 
S. typhimurium harbouring vector or a plasmid carrying SifA gene as in Fig. 1. 
g, NOD2- or vector-transfected 293 cells were stimulated with wild-type, 
sopE, sipA, sopE sipA or invG mutant of S. typhimurium. h, NOD2-transfected 
cells were incubated with rhodamine-conjugated MDP-beads, then uptake 
by cells was quantified by FACS. One representative result of three independent 
experiments is shown. Mean of triplicates + s.d. *P < 0.05 by t-test. 
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Extended Data Figure 2 | LAMP2 and MHCII are enriched in the endo- 
lysosome fraction of dendritic cells. a, b, Microscopic images of endo- 
lysosomes isolated from BMDCs stimulated without (a) or with (b) LPS. 
BMDCs were cultured with 200-nm diameter iron-oxide particles and 
ovalbumin (as an endocytic tracer), then incubated without (a) or with 
30ngml * LPS (b) for 3h. The endo-lysosomes were isolated using 
magnetic separation from the cells lysed in isotonic buffer. Endocytosed 
ovalbumin-Alexa555 (OVA), MHCII-GFP, and bright field (BF) are shown. 
When dendritic cells are stimulated with LPS, isolated endo-lysosomes 
contained numerous tubular structures (arrows). Bars, 5 jum. ¢, Total lysate, 
post-nuclear supernatant (PNS), flow through (unbound), and the fractions 
bound to column were blotted with antibodies against the marker proteins. 
Recoveries were calculated based on densitometric analysis using the Image] 


(NIH). LAMP2 and MHCII are enriched 49- and 19-fold, respectively, in 
fraction 2 with 2% recovery as compared to the cell homogenate. Na" /K* - 
ATPase (plasma membrane), transferrin receptor (early endosomes), 
syntaxin-6 (trans-Golgi), and actin (cytosol) are excluded (0.12-, 0.057-, and 
0.18-fold enrichment, respectively). d, SILAC mass spectrometry of 
endo-lysosomal proteins. Abundance ratios of LPS-stimulated endo-lysosomal 
proteins to unstimulated counterparts are plotted against their relative 
abundance. 1,436 proteins were identified in the endo-lysosome fractions, 

8 and 45 proteins of which were ATP-binding cassette (ABC) transporters 
and solute carrier family transporters, respectively (shown in red). Eight 
(ABCA1, ABCA3, TAP1, TAP2, SLC3A2, SLC7A2, SLC7A11 and SLC15A3) 
were upregulated >twofold by LPS treatment. 
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Extended Data Figure 3 | Localization of membrane transporters identified | SLC15A3 or SLC15A4 (arrow heads). Double arrows on cells expressing 


in the endo-lysosome fraction. a, Membrane transporters identified by SLC7A11, ABCC4 or ABCCI1 mark plasma membrane. b, SLC15A1 (mouse) or 
mass spectrometry were expressed as GFP fusion proteins in 293 cells, then SLC15A2 (human) were expressed as GFP fusion proteins in 293 cells, then 
observed by fluorescence microscopy. Endo-lysosomes are visualized with observed by fluorescence microscopy. Arrows mark plasma membrane. Bars, 
ovalbumin—Alexa555 (OVA) or dextran—Alexa555. Single arrows indicate 5 um. c, Gene expression of SLC15A1 and SLC15A2 were quantified by 


colocalization of membrane transporters labelled with endocytosed OVA or RT-PCR. Means = s.d. (+) denotes LPS-stimulated dendritic cells. 
dextran. Tubular endo-lysosomes were observed only in cells expressing 
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Extended Data Figure 4 | Two di-leucine motifs localize mouse SLC15A3 to 
endo-lysosomes. a, Di-leucine motifs in mammalian SLC15A3. Mouse and rat 
proteins have two di-leucine motifs (ExxxLL/V). b, Fluorescence microscopy 
of di-leucine mutants of mouse SLC15A3-GFP in 293 cells. Alanine 

substitution of either one in two motifs is insufficient to localize the protein to 


plasma membrane. Cell surfaces are marked by arrows. Bars, 5 [um. c, 293 cells 
transfected with an NF-«B-luciferase reporter, NOD2, and either di-leucine 
mutant of mouse SLC15A3(L19A V20A L288A L289A) or empty vector, were 
stimulated with soluble MDP for 7h. Means + s.d. **P < 0.01. 
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Extended Data Figure 5 | SLC15A3 and SLC15A4 specifically facilitate the 
signalling induced by MDP-beads. a, Raw 264.7 cells stably expressing 
SLC15A3 or SLC15A4 were stimulated with MDP (left) or LPS (right) for 24h. 
TNF-o in medium was quantified by ELISA. b, 293 cells transfected with an 
NF-«B-luciferase reporter, NOD2, and either SLC15A3, SLC15A4 or empty 
vector, were stimulated with S. typhimurium flagellin or homan TNF-« for 7 h. 
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c, SLC15A3-deficient or wild-type BMDCs were stimulated with MDP-beads. 
IL-1f in medium were quantified by ELISA. d, BMDCs were incubated 

with MDP-rhodamine-beads, then the uptake was quantified by FACS. 

e, SLC15A3-deficient or wild-type BMDCs were stimulated with LPS (left) or 
S. typhimurium (right) for 24h. IL-6 in medium was quantified by ELISA. 
Means + s.d. *P< 0.05 and **P< 0.01. 
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Extended Data Figure 6 | Endocytosed MDP are localized to endo- 
lysosomal tubules tracked along microtubules. a, 293 cells expressing 
SLC15A3-GFP were incubated with MDP-rhodamine for 6 h. After replacing 
with fresh medium without MDP-rhodamine, cells were observed by 
fluorescence microscopy. b, BMDCs that retrovirally express RFP-tubulin, 
were incubated with OVA-Alexa488 for 1h, stimulated with LPS (30 ng ml — 1) 


for 3h, then observed by fluorescence microscopy. Arrows indicate the 
endo-lysosomes extending along microtubules. c, MHC-II knock-in BMDCs 
that retrovirally express SLC15A3-GFP, were incubated with OVA-Alexa647 
for 1h, stimulated with LPS (30 ng ml’) for 3 h, then observed by fluorescence 
microscopy. Bars, 5 im. 
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Extended Data Figure 7 | NOD2 associates to Salmonella-containing 
phagosomes. a, 293 cells stably expressing RFP-NOD2 and LAMP1-GFP 
were incubated with S. typhimurium expressing tagBFP as in Fig. 4a. b, 293 cells 
expressing RFP-NOD1 were incubated with S. typhimurium expressing EGFP 
as in Fig. 4a, then observed by fluorescence microscopy. ¢, Left, 293 cells 
expressing RFP-NOD2 (FL, full length) or RFP-NOD2-3020insC were 
incubated with magnetic beads (1 tm), lysed in isotonic buffer, then the 
phagosomes were isolated using a magnetic column. Right, Phagosomes were 
isolated from BMDC expressing Flag-NOD2. RFP-NOD2, Flag-NOD2, 
LAMP1, LAMP2, and GAPDH in the phagosome fractions and total lysates 
were detected by immunoblotting of duplicated cell culture. NOD2 proteins in 
phagosomes were calculated by measuring the intensities of protein bands 
using ImageJ with following formula (1). GAPDH was used to estimate 
contamination of cytosol in the phagosome fraction. Approximately 20% of 
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NOD2 and 4% of NOD2-3020insC proteins are co-fractionated with LAMP1 
or LAMP2 in the phagosome fractions. ([NOD2] fraction/ [NOD2) total lysate — 
[GAPDH] gaction/ [GAPDH] total tysate)/(T[LAMP1 J raction/ [LAMP1 total lysate) (1). 
d, BMDCs retrovirally expressing Flag~-NOD2 were incubated with 
MDP-beads for 24h, fixed with 4% formaldehyde, permeabilized with 0.05% 
saponin, then stained with anti-Flag and anti-LAMP2 antibodies, followed 
by secondary antibodies conjugated with Alexa-dyes. The cells were observed 
under fluorescence microscopy. MDP-beads in the merged image are shown 
in blue. BF, bright field. e, 293 cells transfected with an NF-«B-luciferase 
reporter and either NOD2, RFP-NOD2, or RFP-NOD2-3020insC were 
incubated with Salmonella as in Fig. 1. The activation of NF-«B was quantified 
by the luciferase activity. f, Fluorescence microscopy of 293 cells expressing 
GFP-RIPK2. Endo-lysosomes are labelled with dextran-Alexa647. Bars, 5 jum. 
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Signal amplification and transduction in 
phytochrome photosensors 


Heikki Takala’**, Alexander Bjorling*, Oskar Berntsson?, Heli Lehtivuori!, Stephan Niebling’, Maria Hoernke’, Irina Kosheleva’, 
Robert Henning’, Andreas Menzel’, Janne A. Thalainen! & Sebastian Westenhoff? 


Sensory proteins must relay structural signals from the sensory site 
over large distances to regulatory output domains. Phytochromes 
are a major family of red-light-sensing kinases that control diverse 
cellular functions in plants, bacteria and fungi’. Bacterial phyto- 
chromes consist of a photosensory core and a carboxy-terminal reg- 
ulatory domain". Structures of photosensory cores are reported in 
the resting state’” * and conformational responses to light activation 
have been proposed in the vicinity of the chromophore” **. However, 
the structure of the signalling state and the mechanism of downstream 
signal relay through the photosensory core remain elusive. Here we 
report crystal and solution structures of the resting and activated states 
of the photosensory core of the bacteriophytochrome from Deinococcus 
radiodurans. The structures show an open and closed form of the 
dimeric protein for the activated and resting states, respectively. This 
nanometre-scale rearrangement is controlled by refolding of an evo- 
lutionarily conserved ‘tongue’, which is in contact with the chromo- 
phore. The findings reveal an unusual mechanism in which atomic-scale 
conformational changes around the chromophore are first ampli- 
fied into an angstrom-scale distance change in the tongue, and further 
grow into a nanometre-scale conformational signal. The structural 
mechanism is a blueprint for understanding how phytochromes con- 
nect to the cellular signalling network. 

The most common domain architecture of the photosensory core 
of phytochromes in bacteria, plants and fungi is PAS-GAF-PHY (Per/ 
Arndt/Sim-cGMP phosphodiesterase/adenyl cyclase/FhlA-phytochrome 
specific)’®. We investigate the PAS-GAF-PHY fragment of the bacter- 
iophytochrome from D. radiodurans. Figure 1a shows that this protein 
fragment shares the prototypical response of phytochromes to red/far- 
red illumination. Its biliverdin chromophore relaxes into the Pr (red- 
absorbing) state when kept in the dark or after exposure to far-red light. 
The majority of molecules transform into the Pfr (far-red-absorbing) 
state after red-light exposure. The dark relaxation from Pfr to Pr occurs 
in a matter of days, and the optical response of PAS-GAF-PHY is very 
similar to the full-length protein”. For the PHY-less PAS-GAF con- 
struct, the absorption spectra (Fig. 1a) and dark reversion within min- 
utes indicate an incomplete photocycle™. 

Time-resolved solution X-ray scattering reports on structural changes 
in protein reactions”*”°. We used it here to characterize the structural 
dynamics of the phytochrome photocycle. In a first experiment, the phy- 
tochrome fragments were repeatedly switched between the Pr and Pfr 
states using laser flashes of 10-ms duration at 671 nm (Pr — Pfr) and 
750 nm (Pfr— Pr). The X-ray solution scattering was recorded in bet- 
ween the laser flashes for approximately 1 s (see Supplementary Infor- 
mation for details). The light-induced change in scattering, AS(q), encodes 
the structural change of the protein during the Pr— Pfr transition. For 
the complete photosensory core (PAS-GAF-PHY) we find a large oscil- 
lating difference scattering signal at low angles, q < 2 nm ' (Fig. 1b, 
red line), indicating large structural changes on the nanometre scale. 
Oscillations at higher q, which report on structural changes on smaller 


length scales, were also observed (Extended Data Fig. le), but are not 
discussed further here. The laser-induced AS of the PAS-GAF-PHY con- 
struct was reproduced by a standard small-angle X-ray scattering (SAXS) 
experiment with pre-illuminated samples (Fig. 1b, blue line). Time- 
resolved difference X-ray scattering data covering the micro- and milli- 
second time scales also reproduce these features when photoconversion 
is complete (delay time, 30 ms; Fig. 1b, black line). These data further 
establish that the discussed low-q features grow in with a time-constant 
of 4.3 ms (Fig. 1b and Extended Data Fig. 1d), which coincides with the 
formation of the Pfr state as measured by optical absorption of the 
chromophore (1.0 ms, Extended Data Fig. 1d). 

When performing the same experiments on the PHY-less chromophore- 
binding (PAS-GAF), the oscillatory signal at low q is absent (Fig. 1c). 
The observed oscillations in the PAS-GAF domains are much smaller 
in amplitude and confined to higher q ranges (Fig. 1c and Extended Data 
Fig. le). These data show that quaternary structural changes occur only 
when the PHY domain is present. In the wild-type protein the PHY 
domain connects the chromophore binding domains PAS-GAF with 
the output kinase, making it likely that the detected rearrangement is 
functionally relevant. 

To examine the nature of these conformational changes and to find 
out how they arise, we crystallized the photosensory core in the dark 
(referred to as dark) and under periodic illumination at 655 nm (referred 
to as illuminated), which transforms a large fraction of the protein mol- 
ecules into the Pfr state (see Fig. la and Extended Data figures for details). 
Absorption spectra of the dark crystals indicate the Pr state, whereas 
the illuminated crystals are Pfr-enriched (Extended Data Fig. 3c). Omit 
maps indicate a change in the conformation of the chromophore bet- 
ween the two crystal forms (Extended Data Fig. 3b). Although the bil- 
iverdin conformation cannot be fit unambiguously to the electron density 
in the illuminated structure, the electron density of the rest of the pro- 
tein is homogeneous (Extended Data Fig. 2b). 

Our dark crystal structure was modelled from data up to 3.80 A res- 
olution (Fig. 2a). The observed domain arrangement is similar to pre- 
viously reported structures of bacterial PAS-GAF and PAS-GAF-PHY 
protein fragments'*"!>””, but the dimer arrangement differs from some 
of these structures’*’”. The illuminated structure, modelled against data 
cut at 3.24 A, shows marked differences compared to the dark structure 
(Fig. 2a). First, the dimer adopts an open Y-like shape of the PHY do- 
mains. This is in contrast to all reported resting state structures, where 
the dimer is closed (Fig. 2a)'*’*"7”°. Second, the tongue of the PHY do- 
mains (residues 446-477) appears as an o-helix and a loop in the illu- 
minated model and is a B-sheet in the dark model (Fig. 2b and Extended 
Data Fig. 2). These two folds have been observed separately in non- 
canonical bacterial'*”° and prototypical cyanobacterial phytochromes’*”, 
respectively (Extended Data Fig. 4). Our structures establish that the 
refolding of the tongue occurs within the same bacterial phytochrome, 
and suggest that this refolding is associated with opening of the dimer. 
The change in fold causes the length of the tongue to vary between the 
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Figure 1 | Time-resolved solution X-ray scattering of the PAS-GAF and 
PAS-GAF-PHY fragments from D. radiodurans. a, Absorption (A) spectra of 
protein fragments after illumination with far-red (780 nm) and red (655 nm) 
light, labelled Pr and Pfr respectively. AU, arbitrary units. b, c, Solution X-ray 
scattering data from the PAS-GAF-PHY and PAS-GAF fragments shown on 
the same scale. Time-resolved data (black, BioCARS), direct static data 
collected by laser-induced population switching (red, cSAXS) and indirect 
static data from a standard SAXS experiment with pre-illuminated samples 
(blue, BM29) is shown. BioCARS, cSAXS and BM29 refer to synchrotron 
endstations (see Methods). AS is the difference in scattered X-ray intensity 
caused by illumination at 671 nm. q = 4n/2 X sin@ at wavelength / and 
scattering angle 20. 


dark and illuminated structures (Fig. 2b), which is key for understand- 
ing the role of the tongue in signal relay. 

Before detailing this mechanism, it is important to test how well the 
crystal models represent the solution structures of the Pr and Pfr states. 
To this end, we performed structural refinement against the solution dif- 
ference X-ray scattering data (Fig. 1), which is a sensitive indicator of con- 
formational change. It has the advantage over absolute scattering profiles 
that experimental error and uncertainties arising from the choice of method 
for scattering calculations largely cancel out (Extended Data Fig. 5c). 
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Figure 2 | Dark and illuminated crystal structures of the PAS-GAF-PHY 
fragment from D. radiodurans. a, Crystal structures of the PAS-GAF-PHY 
dimer in the dark and illuminated forms. The tongue of the PHY domain 
(green) changes fold and the dimer opens up in the illuminated state. The 
biliverdin chromophore is shown in orange. b, Fold and interactions of the 
PHY tongue. The B-sheet (dark) coordinates to Asp 207 and Tyr 263 via 
Arg 466, whereas the «-helix (illuminated) coordinates via Ser 468, both of 
which are part of the conserved “*°P-R-X-S-F*® motif. The named residues are 
shown as sticks. The B-sheet (dark) is further held by hydrogen-bonding 
interactions between the amide groups of Ala 450, Gly 452 and Arg 202. The 
change in PHY domain conformation leads to a shortening of the tongue by 
2.5 A as measured between GAF (Arg 202) and PHY (Tyr 479) domains 
(arrows). A backbone interaction close to the PHY domain between Leu 445 
and Tyr 479, shared by both crystal structures, is also indicated. The green 
dashed lines indicate regions that are not modelled or not shown for clarity. 


We generated candidate solution structures by running two sets of 
molecular dynamics simulations starting from the dark and illuminated 
crystal structures (see Supplementary Information for details). These 
are referred to as Pr and Pfr trajectories as they aim to model solution 
scattering data representing these states. Snapshots were recorded every 
50 ps and the solution X-ray scattering was calculated from each”’. All 
pairs of Pr/Pfr candidate structures were then compared to the static 
difference X-ray scattering (Fig. 1b and Extended Data Fig. 7, see Sup- 
plementary Information for details). We selected the 100 Pr/Pfr pairs 
with best agreement to the data and consider the participating struc- 
tures to be representative of the protein in solution. Considering all 
pairwise differences between these structures, a consistent set of 747 curves 
was generated and evaluated (Fig. 3a). The agreement of the model and 
experiment is excellent, and considerable improvement is made com- 
pared to the crystal structures. 

To verify the choice of solution structures on the basis of difference scat- 
tering, these structures were cross-validated against absolute, population- 
corrected SAXS data as described in Supplementary Information (see 
also Extended Data Fig. 5). Figure 3b shows that the agreement with 
absolute SAXS data is strongly correlated with the separation of the 
PHY domains, and that the solution structures, proposed purely on 
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Figure 3 | Refinement of solution structures against difference SAXS data. 
a, Calculated differences X-ray scattering between the proposed solution 
structures (Pfr — Pr, grey) agree with experimental data (black). As a safeguard 
against overfitting, the grey curves show all 747 differences between solution 
structures, not just the 100 best curves on which the pairs were selected (see text 
for details). The agreement with experiment is considerably improved for 

the solution structures, compared to the difference scattering calculated from 
the two crystal forms (green). b, Validation of the obtained solution structures 
(squares) against absolute X-ray scattering. The vertical axis shows the 
sum-squared error of each structure (SSE,p;) compared to the total sum of 
squares about the mean (SST). All molecular dynamics snapshots (dots) and 
crystal structures (diamonds) are scored against absolute SAXS data (BM29) 
after correction for the mixed Pr/Pfr populations (see Extended Data Fig. 5d). 
The two shades of red and blue correspond to different simulation conditions 
as detailed in Supplementary Information. The PHY domain separation 

is measured as the distance between the centres of mass of the two 
C-terminal helices (residues 484-503). 


difference scattering, cluster in the low-error parts of these correla- 
tions. The agreement of the solution structures to the solution scatter- 
ing is considerably improved compared to the crystal structures as can 
be seen in Fig. 3b. Representative structural models for Pr and Pfr states 
in solution are depicted in Fig. 4. 

The solution-structural models show that the PHY domains of oppos- 
ing monomers come fairly close to each other in the Pr state but move 
apart by ~3 nm during the Pr — Pfr transition (Fig. 4). The Y shape 
assumed by the solution structures in the Pfr state is in qualitative 
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Figure 4 | Proposed solution structures of the bacterial phytochrome from 
D. radiodurans. a, b, Representative solution structures for the Pr and Pfr 
states of the photosensory core, identified from solution X-ray scattering 
experiments. Nine Pfr and ten Pr structures are presented viewed along (a) and 
perpendicular (b) to the dimer symmetry axis. The long scaffolding helix is 
highlighted in blue, the PHY tongue in green, and the biliverdin chromophore 
in orange. The PHY domain separation differs by about 3 nm between the 

Pr and Pfr structures as shown in Fig. 3b. The hinge region at the scaffolding 
helix (residues Val 318 and Lys 319) in Pfr is indicated with black arrowheads. 


agreement with a low-resolution envelope of a Rhodopseudomonas 
palustris phytochrome determined by SAXS”’. Our crystal structures 
capture the essence of this quaternary conformational change, albeit 
with differences in the amplitude and details of the motion. These dif- 
ferences are probably caused by crystal contacts (Extended Data Fig. 6). 
Nevertheless, the overall agreement with the solution structures lends 
strong support to the idea that the refolding of the tongue observed in 
the crystal structures is intrinsic to the structural rearrangement bet- 
ween Pr and Pfr states. 

From the crystal and solution structures the following mechanism for 
signal transduction through the photosensory core emerges. It is known 
that the rotation of the biliverdin D-ring causes atomic rearrangements 
in the chromophore binding pocket, including displacement of Asp 207 
and Tyr 263 (refs 19-22). Our crystal structures suggest that this con- 
trols the fold of the tongue of the PHY domain. Importantly, the tongue 
is shortened as a result of the refolding and the distance between the 
GAF and PHY domain is reduced by 2.5 A (Fig. 2b) upon red-light illu- 
mination. As a consequence of this, and demonstrated by our Pr and 
Pfr solution structures, the dimer opens up between the PHY domains 
by several nanometres (Fig. 4). 

Three factors are essential for the proposed structural mechanism. 
First, the PAS, GAF and PHY domains, which are connected by the 
tongue, are known to be very rigid internally such that they cannot de- 
form to absorb the length variation of the tongue’’. Second, the tongue 
and its junctions to PHY and GAF have to be rigid, especially in Pr, in 
which the PHY and GAF domains are pushed away from each other. In 
our dark structures the direction of the tongue is rigidly fixed with respect 
to the GAF domains by the conserved interaction Arg 466 to Asp 207 and 
Tyr 263 and additional backbone hydrogen bonding (Fig. 2b)'’. In the 
illuminated structure the tongue binds to GAF through interaction of 
Ser 468 with Asp 207 and Tyr 263 (ref. 12), and through hydrophobic 
interactions, among others by Tyr 472 (Extended Data Fig. 4b)'’. The 
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tongue is also locked close to the PHY domain by additional backbone 
hydrogen bonding. Third, the PAS-GAF and PHY domains are connected 
by the unusually long scaffolding helix, which is necessary to redirect 
the shortening of the tongue into bending of the monomer. Indeed, our 
simulations reveal a hinge in the scaffolding helix (Fig. 4b). All phyto- 
chrome structures published to date support these three requirements’. 

In bacterial and fungal phytochromes, signal output is through C- 
terminal histidine kinase domains that autophosphorylate and further 
phosphotransfer to a response regulator’”. In plant phytochromes, two 
additional PAS domains are included in the C-terminal regulatory region 
and a more complex pattern of functions has to be controlled, such as 
serine/threonine kinase activity’, and affinity to interaction partners””®. 
In all cases, the output activity is probably controlled by a structural 
change in the photosensory core. The photosensory core and the key 
amino acid sequence in the tongue region “*°P-R-X-S-F*™ (where X 
denotes any residue) are highly conserved over the whole phytochrome 
superfamily (Extended Data Fig. 2c). Here we suggest a mechanism in 
which the fold of the PHY-tongue controls the bending of the mono- 
mer. For the isolated photosensory unit in this study, this results in 
marked opening of the dimer in Pfr (Figs 2 and 4). In full-length phy- 
tochromes it becomes conceivable that the conformations of output 
domains are modified. Alternatively, previously buried patches of the 
protein could become accessible to interaction partners, or monomers 
could rearrange with respect to each other. It will be intriguing to see 
the emergence of these mechanisms, which become deductible with ex- 
perimental approaches similar to the one presented here. 


METHODS SUMMARY 


The PAS-GAF-PHY and PAS-GAF fragments from D. radiodurans were expressed 
in the Escherichia coli strain BL21 (DE3) and purified by affinity and size-exclusion 
chromatography. Crystallographic data was collected at beamline ID23-1 of the 
European Synchrotron Radiation Facility (ESRF) (see Extended Data Table 1). Time- 
resolved X-ray scattering with millisecond time resolution were recorded at beam- 
line cSAXS of the Swiss Light Source**. SAXS measurements were performed at 
beamline BM29 of the ESRF and analysed as summarized in Extended Data Table 2a. 
Time-resolved X-ray scattering data in the micro- and millisecond ranges were 
collected at beamline ID-14-B, BioCARS, of the Advanced Photon Source at Argonne 
National Laboratory. All solution-scattering sample details are summarized in Ex- 
tended Data Table 2b. Molecular dynamics simulations (GROMACS 4.5.5)” were 
used to generate trial solution structures and theoretical scattering curves were eval- 
uated using Zernike expansion as implemented in SASTBX”®. The structures were 
scored against experimental scattering data as detailed in Supplementary Information. 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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Extended Data Figure 1 | Difference scattering and kinetics. a-c, Singular 
value decomposition (AS (q, t) = US V", where U, Sand Vare matrix factors) of 
time-resolved solution scattering data from the PAS-GAF-PHY fragment. Two 
components () suffice to describe the data, the final product (n = 1) anda 
transient low-q depression (n = 2). a, The first three basis spectra (first, second 
and third columns of US), and original X-ray scattering data (black) with 
reconstruction based on the first two singular values (all columns of US V", red) 
are shown. b, Relative amplitudes of the two first basis spectra (first and second 
columns of V). Lines are guides for the eye. c, Singular values (diagonal 
elements of S). d, The rise of the Pfr product state as measured by direct 


integration of difference scattering curves ((AS(q)”) <q< 95) and by optical 
transient absorption at 754 nm after excitation with a 680 nm laser flash. Solid 
curve is the function (1 — e “’"), where t is time and and 7 a free fitting 
parameter. These data establish that the structural change occurs just after the 
Pfr state is formed in the chromophore. Note the positive signal in the 
absorption curve with very small amplitude at >3 ms, which appears to decay 
while the structural signal rises. This could be because the absorption 
properties depend weakly on the large-scale rearrangement. e, Direct static 
difference data from Fig. 1, amplified by q” to reveal wide-angle oscillations. 
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Extended Data Figure 2 | Light-induced changes in the secondary structure 
of the evolutionally conserved PHY tongue. a, Secondary structure and 
topology of the D. radiodurans PAS-GAF-PHY fragment. The structural 
elements in our crystal structures are very similar to other published 
phytochrome structures'*!’. The PHY tongue region (box), however, was 
found to refold upon illumination. The five-stranded B-sheet core of the GAF 
domain is extended by a small sixth B-strand (called 2') that interacts with the 
PHY tongue (see Fig. 2 and Extended Data Fig. 4). The mini-sheet structure at 
the knot region" is not included in the graph. The figure is adapted from ref. 12. 
b, Omit map of the PHY tongue in the dark crystal form (top panel) and the 
illuminated crystal form (bottom panel). In the dark crystal form, the omit map 
density (blue) supports the built B-turn secondary structure (orange sticks), 
even though most of the side chains are poorly resolved. In the illuminated 
crystal form, the omit map (blue) clearly reveals the density of a helix with its 
bulky side chains (orange). The omit maps were calculated by repeating 
molecular replacement and a refinement step (see Supplementary Information) 
with a structure in which the PHY tongue was removed. All electron density 
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maps are contoured at 3.0c. c, Sequence alignment of part of the GAF domain 
and of PHY loop region. The conserved *”’D-I-P”” motif in the GAF domain 
and “P-R-X-S-F*” motif in the PHY tongue are marked by asterisks. Five 
representatives from eubacterial (BphP), cyanobacterial (Cph), higher plant, 
fungi (Fph) and PAS-less phytochromes are shown. The full alignment is 
supplied as source data. Colour coding: dark grey, identical residues; light grey, 
similar residues. The amino acid sequences were aligned with Clustal Omega 
using the default settings*'. Sequences include Deinococcus radiodurans BphP, 
Pseudomonas syringae pv. tomato T1 BphP, Rhodopseudomonas palustris TIE- 
1 BphP3, Pseudomonas aeruginosa PAO1 BphP, Agrobacterium fabrum str. 
C58 BphP1 (Agp1), Synechocystis sp. PCC6803 Syn-Cph1, Microcystis 
aeruginosa NIES-843, Nodularia spumigena CCY9414, Cyanothece sp. PCC 
7822, Anabaena variabilis ATCC 29413, Physcomitrella patens Phy1, Zea mays 
PhyB1, Populus trichocarpa PhyA, Selaginella martensii Phy1, Arabidopsis 
thaliana PhyA, Synechococcus OSB’ SyB-Cph1, Synechococcus OSA SyA-Cph1, 
Nostoc punctiforme PCC73102, Lyngbya sp. PCC 8106 and Anabaena variabilis 
ATCC 29413. 
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Extended Data Figure 3 | Biliverdin structure and spectra in crystals. 

a, Photographs of the crystals under cryogenic conditions at the beamline 
1D23-1. b, Biliverdin omit maps of the dark (top left) and illuminated (top 
right) form support the existence of the modelled biliverdin conformations 
(yellow and orange). Comparison of the electron density around the biliverdin 
with published structures (bottom panels). In the dark form, the electron 
density indicates a conformation similar to the published Pr structures'®'**?**, 
including a D. radiodurans structure (PDB accession 209C, cyan)’. Therefore 
we modelled the biliverdin as a 15Za isomer. In the illuminated form, however, 
the electron density supports neither the biliverdin as determined in the Pfr 
structure of PaBphP (PDB 3NHQ, red)”®, nor as determined in the Pr structure 
of D. radiodurans (209C, cyan)’. Therefore the rotation of the biliverdin 
D-ring cannot be reliably determined and is modelled with both possibilities 
(15Za, and 15Ea, orange and yellow in top right panel). Omit maps were 
calculated as in Extended Data Fig. 2 and contoured at 3.0c. c, Representative 
absorption spectra of the dark (black) and illuminated (grey) crystals, recorded 
at 123 K. Note that the terms ‘illuminated’ and ‘dark’ refer here to the 
crystallization conditions (see Supplementary Information for details). The 
dark crystal spectrum resembles the Pr spectrum in solution (Fig. 1a). 
Illumination with red light in the crystallization drops at ambient temperature 


Illuminated form c 


Dark 


A (arb.) 


Illuminated 


600 700 


led to a slight increase of far-red absorption and disintegration of the crystals 
(data not shown). Dark crystals were unaffected by far-red illumination. The 
spectrum of the illuminated crystals shows that a substantial proportion 
(>50%) of the proteins reside in Pfr state. A similar absorption was detected 
from solubilized crystals. The illuminated crystals could be switched to Pr-like 
absorption with far-red illumination. Reversely, the Pfr-like features could also 
be increased with red light (data not shown) with illumination at ambient 
temperature. Exposure with light increased the scattering background in the 
absorption measurements. The crystals seemed unaffected by the illumination 
when illuminated with red light in the crystallization drops. Although the 
spectral analyses of the illuminated crystals do not indicate a pure Pfr spectrum, 
and the biliverdin conformation cannot be fit unambiguously to the electron 
density, the remainder of the electron density is homogeneous (Extended 
Data Fig. 2b). Most importantly, the tongue region of the PHY domain 
adopts the conformation resembling the Pfr state of PaBphP (Extended Data 
Fig. 4b)'*"°?°. The conformations of the four monomers in an asymmetric 
unit are practically identical and hence we conclude that biliverdin can co-exist 
in both Pr and Pfr states inside this crystal form and still the protein 

part represents the structural aspects of the Pfr state only. 
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Extended Data Figure 4 | Comparison of PHY tongue interactions with 
published structures. a, Comparison of the dark crystal form (green/dark 
grey) to cyanobacterial cph1 in Pr state (PDB 2VEA, orange/light grey)’’. 

b, Comparison of the illuminated crystal form (green/dark grey) to PaBphP in 
Pfr state (PDB code 3NHQ”, orange/light grey). In both the Pr and Pfr forms, 
key interactions are conserved between the phytochromes (black dashes), as 
well as the positions of three conserved tongue motives (see Extended Data 
Fig. 2c). The residues of these three motives are indicated as: *°'W-(G/A)-G**, 
469p_R-X-S-F*® and 472(W/E,Y)-X-E*4, with numbering from the 

D. radiodurans sequence. Trp 451 was not modelled in our illuminated crystal 
structure, and part of the PHY tongue has been removed for clarity. Small 
changes in relative orientations between the difference crystal structures are 
observed, for example, a slight tilt of helix of the Pfr tongue. 
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Extended Data Figure 5 | SAXS data and calculations. a, Experimental SAXS 
data of dark (‘Pr’) and pre-illuminated (‘Pfr’) samples. Here, I is scattered 
intensity, 20 is the scattering angle and q is the magnitude of the scattering 
vector. The data are merged from the concentration series (Extended Data 
Table 2b) and are normalized on 0.4 nm i< q<0.6 nm |‘. b, Guinier plot of 
the low-q region, shown for all concentrations. The curves converge at low 
concentration (C). Inset shows the radii of gyration (Rg) calculated from the 
curves in a according to the Guinier approximation. Dotted lines show the 
q-range used for linear regression (0.15 nm’ 7 <q’ <0.120nm *), which is 
such that qR, < 1.3 as required**. Arrows indicate increasing concentration. 
c, Average difference scattering signals calculated from the solution-structural 
models using three methods: Crysol (default settings), SASTBX with spherical 
harmonic expansion (SHE, default settings), and SASTBX with Zernike 


100-v (% Pfr) 


polynomial expansion as described in Supplementary Information”. Different 
calculation methods result in very similar predicted difference X-ray scattering 
patterns. d, Determination of the relative Pr/Pfr populations represented by the 
BM29 data as described in Supplementary Information. SSE,,, is the sum- 
squared error against absolute scattering data, SST is the sum of squares about 
the mean, and v is the Pfr fraction. We find that our Pr sample contained only 
Pr (top) whereas in the Pfr sample 64% of the protein molecules adopted the Pfr 
conformation (bottom). Notably, the static difference scattering patterns 
(Fig. 1) represent, up to a scaling factor, the relation between pure Pr and Pfr 
populations. This is in contrast to traditional SAXS which report on population 
mixtures, because the Pfr state cannot be easily produced with 100% population 
in solution. 
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Extended Data Figure 6 | Packing interactions of the crystal forms. 

a, b, Crystal packing interactions of the dark (a) and illuminated (b) crystal 
forms. The dimer of an asymmetric unit is shown in red and the symmetry 
mates in grey. For the illuminated form, only chains A and B are shown in red. 
Interactions are shown from two orientations for clarity. In the dark form, 
crystal contacts are seen in the top regions of the PHY domains and therefore 
may cause artefacts in the long scaffolding helix and in the opening of the PHY 
domains. In the illuminated form contacts are such that the PHY domains may 
be pushed closer together, which is consistent with the larger separation of the 


b Illuminated form 


LETTER 


PHY domains as refined from the solution X-ray scattering data. It is 
noteworthy that the relative orientation of the monomers in the dimer is 
different between all three known structures for PAS-GAF-PHY 
phytochromes. For P. aeruginosa the dimer is parallel with variations in 
between different copies of the dimer in the crystallographic unit cell’’, in two 
cyanobacterial phytochrome an antiparallel dimer is observed’***, and in our Pr 
structure, the monomer have an angle of approximately 45°. The rings 
highlight crystal contacts. 
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Extended Data Figure 7 | Solution-structural refinement. a, The to reject all molecular dynamics frames Rpp > Recut. The resulting average over 
distribution of PHY domain separations (Rpp) obtained from unbiased Rpp of the best N = 100 pairs is plotted as a function of R.. It is found that 
molecular dynamics simulations (production runs 1-3). The best-matching Rpp ~ Rou» which indicates that the best fit to the difference X-ray scattering 
sets A and B of solution structures found from these conformations are data are always at the highest separations available in sampling range. d and 


indicated by crosses (with N = 100 best pairs chosen from M ~ 6 X 10° possible _ e show the dependence of the total and average error (SSE,,¢ and SSE,,) as 
combinations). Whereas the Pr structures cluster in a region of high sampling, functions of R.y,. SST represents the overall sum of square about the mean. 


the Pfr structures lie at the edges of the PHY-PHY distribution, suggesting It is observed that the error decreases steeply for Rey, <5 nm, and only 
inadequate sampling. To remedy this, we artificially scanned the PHY domain _ marginally for Ra >5nm. We therefore consider optimization in the latter 
separation in separate simulations (production runs 4 and 5) to improve range overfitting, and applied Rout = 5.0 nm in the refinement for the solution 
sampling. b, The new distribution of pulled PHY domain separations in the Pfr structures. The refined structures (Fig. 4) should therefore be considered to 
state. The final analysis and all solution-structural conclusions drawn in this represent lower limits on Rpp. f, Solution-structural refinement parameters 


study are based on the trajectories described in b. c-e, Consistency test of the and results. 
structural refinement procedure. c, A cutoff parameter R.,, was introduced 
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Extended Data Table 1 | Crystallographic data collection and refinement statistics 


Dark form* 


Illuminated form* 


Data collection 
Space group 
Cell dimensions 

a, b, c (A) 

a, B, y (°) 
Resolution (A) 
Rmerge (%) 

Ilo 
Completeness (%) 


Redundancy 


Refinement 

Resolution (A) 

No. reflections 

Rwork ! Piree 

No. atoms 
Protein 
Ligand/ion 
Water 

Average B-factor (A”) 
Protein 
Ligand/ion 
Water 

Wilson B-factor (A”) 

R.m.s deviations 
Bond lengths (A) 
Bond angles (°) 


P6, 


150.28,150.28, 145.95 
90.0, 90.0, 120.0 
48.65-3.80 (3.90-3.80)" 
39.2 (451.3) 

4.72 (1.05) 

99.9 (100.0) 

11.0 (11.6) 


48.65-3.80 (3.90-3.80) 
17597 (1295) 
25.3/26.6 (37.10/40.20) 


7416 
86 

0) 
210.24 
208.71 
170.21 


130.97 


0.006 
1.112 


P21212; 


86.90, 195.70, 225.00 
90.0, 90.0, 90.0 
49.22-3.24 (3.32-3.24)t 
24.1 (439.0) 

9.69 (0.83) 

99.9 (100.0) 

19.3 (20.3) 


49.22-3.24 (3.32-3.24) 
58759 (4255) 
23.3/26.0 (39.7/39.7) 


14784 
172 

0 
158.61 
156.92 
143.48 


118.65 


0.005 
1.159 


*Data from two crystals were used. 
tHighest resolution shell is shown in parenthesis. 
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Extended Data Table 2 | SAXS statistics and sample details 


A 
PAS-GAF-PHY Bovine serum albumin 
Dark Illuminated (BSA) 
Rg (nm) 3.75 + 0.02 3.78 + 0.02 3.10 + 0.01 
Ry (nm)* 3.72 3.77 
Dmax (nM) 11.5 12.4 
Veoroa (nm*) 173 165 
ly 97.3 + 0.2 86.8 + 0.3 64.3 + 0.03 
MWexp (kDa) ~100 ~90 ~66 
MWeate (kDa) 113.0 113.0 


*from manual analysis in Extended Data Figure 5 


B 

Sample Conc. [mg/ml] Buffer Salt pH 
cSAXS 

PAS-GAF 18 20 mM Tris 75 mM NaCl 7.8 
PAS-GAF-PHY 25 20 mM Tris 150 mM NaCl 7.0 
BioCARS 

PAS-GAF 28 30 mM Tris 8.0 
PAS-GAF-PHY 25 30 mM Tris 8.0 
BM29 

PAS-GAF 0.5-26* 30 mM Tris 8.0 
PAS-GAF-PHY 0.5-29* 20 mM Tris 150 mM NaCl 7.0 


*see Extended Data Figure 5 


a, Statistics of the static SAXS (BM29) data, including radii of gyration (Rg), maximum particle dimension 
(Dmax), Porod volume (Vporoa); forward scattering (/o). The molecular weights (MWexp) were estimated 
from the Io of BSA using the formula MWex, ~ MW(BSA)*[/o(sample)//o(BSA)], where 

MW(BSA) = 66 kDa. Theoretical molecular weights (MW<ai-) were calculated from the protein 
sequence*®. b, Details of the samples used for solution scattering measurements. 
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CAREERS 


TURNING POINT Neuroscientist reaches 
milestone in fruit-fly brain map p.251 


| WORK-LIFE BALANCE Creativity breaks can 
pay job dividends, study finds p.251 


NATUREJOBS For the latest career 
listings and advice www.naturejobs.com 


CONTINUING EDUCATION 


Study broad and deep 


Short tutorials can help researchers to sharpen skills, and longer courses can provide 
away to deepen knowledge. 


BY SARAH WEBB 


successful research career demands 
A scientists engage in non-stop 
learning — after all, technologies 
advance, interests shift and discoveries trans- 
form understanding, especially in today’s era 
of ‘big data. Training sessions such as work- 
shops, courses or online tutorials can fill the 
gaps in a scientist’s knowledge and skills, 
helping them to improve and expand their 
research programme and support their efforts 
to land a job. 
To derive maximum benefit from off-site 
training, researchers need to identify exactly 


where the gap is, then search online or talk to 
colleagues to find a workshop or course that 
matches their needs — scientific societies and 
conferences are often good sources — and 
manage logistics, including cost and timing 
(see ‘The right fit’). They may want to apply for 
grants or fellowships from societies or other 
professional organizations to cover tuition and 
off-site travel expenses, and allocate the time 
needed to do so. They may have to calculate 
how to juggle their resources if they are unable 
to get funding or if their employer cannot pro- 
vide support. Some courses are offered only 
once a year and may fill up quickly, so prospec- 
tive students need to make themselves aware of 
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enrolment dates and be ready to pull the trigger. 

When does a short course or workshop 
make sense? Researchers often turn to train- 
ing manuals or online tutorials when they 
need to pick up a new laboratory technique or 
learn how to use equipment or software. But 
sometimes hands-on, in-person instruction is 
more effective than reading, especially when 
it comes to mastering complex tasks such as 
troubleshooting instrument problems or man- 
aging data analysis. 

Are the costs worth it? Tuition might cost 
anything from a few hundred dollars for a day- 
long workshop to US$1,500 for a week-long 
course — and that would not include travel > 
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> expenses. (Some day-long workshops and 
courses are scheduled directly before or after 
major scientific meetings, allowing researchers 
to save on travel costs.) 

Furthermore, if the researcher is lucky, the 
right offering will be close by. Saik-Kia Goh, 
a bioengineering PhD student at the Uni- 
versity of Pittsburgh in Pennsylvania, took a 
free two-week course in clinical and transla- 
tional research at the US National Institutes 
of Health (NIH) in Bethesda, Maryland, an 
easily drivable distance of 370 kilometres. 
Accommodation costs came to about $2,000, 
half of which he covered himself — but he 
not only gained a greater understanding 
of the clinical and regulatory side of stem- 
cell research, he was also able to discuss his 
research with NIH scientists and learn about 
postdoctoral opportunities he had not been 
aware of. “I wasn't expecting it to be as much 
of a networking opportunity,” he says. “It was 
an investment in my career.’ 

Sometimes, however, researchers need 
to travel halfway around the world. Kirstin 
Wurms, a plant pathologist at Plant & Food 
Research, a government institute in Hamilton, 
New Zealand, examines mechanisms of plant 
resistance to disease. The field increasingly 
relies on gene-expression data from a tech- 
nique known as the quantitative polymerase 
chain reaction (qPCR), which requires spe- 
cialized expertise to produce reliable, repro- 
ducible data. Wurms, who did not have that 
know-how, initially relied on a colleague's 
assistance. But when that colleague left the 
institute, she knew that she needed to learn the 
technique herself. She signed up for two short 
courses offered last September at TATAA 
Biocenter, the world’s largest qPCR training 
centre and service 
provider, in Gothen- 
burg, Sweden. Sup- 
ported by a fellowship 
that helps to pay for 
overseas travel for 
scientists, plus some 
funds from her insti- 
tute, Wurms learned 
the technique, its 
international guide- 
lines and how to use 


4 


a statistical program “It was an 

to analyse data. investment , 
International Inmy Career. 

travel was also on = Saik-Kia Goh 


the cards for Stefan 

Suter, a wildlife-management researcher at 
the Zurich University of Applied Sciences in 
Switzerland. He needed to learn how to ana- 
lyse communication between wild boars, and 
found that the Cornell Lab of Ornithology in 
Ithaca, New York, offered a week-long course 
on working with sound-analysis software. 
Suter had used a simplified version of the soft- 
ware during his doctoral research, but needed 
to go deeper to realize his plan of protecting 
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THE RIGHT FIT 


How to find the ideal course 


Trying to figure out how to fill a knowledge 
gap can be overwhelming. Jacquelyn Gill, a 
palaeoecologist at the University of Maine in 
Orono, has advice on how to tackle the issue. 
@ Look around online and talk to colleagues 
and mentors about the skills you would like 
to learn and how you might bridge the gap. 
@ Know your learning style. If you work well 
independently, you might be able to learn 
about a statistical model or a programming 
language simply by working through a 
textbook. If you need group accountability, 
you might be able to work with a group of 
other graduate students or postdocs who 


agricultural fields. The course helped him 
to create warning calls that scare boars away 
from valuable crops. Suter received full 
financial support from his university for the 
US trip because he made the case that taking 
the course would also allow him to improve 
his English. His previous experience work- 
ing with the software meant that Suter could 
focus on specific questions and maximize 
the opportunity to get answers from experts. 
Although the course had a basic curriculum, 
“there was a lot of space for individual ques- 
tions, and an interactive approach, which I 
really appreciated’, he says. 


UNEXPECTED BENEFITS 

Short courses can be effective for quickly 
acquiring new skills. But at other times — 
when aiming to deepen their knowledge of a 
research area or to explore a new field — a 
researcher may want to devote more than a 
week or two to their pursuit, even though the 
costs of tuition, travel and accommodation are 
likely to be much higher. Four to eight weeks 
might sound like a long time to be away from 
one’s primary research, but it can provide an 
opportunity to dig more deeply into a disci- 
pline or to expand into a new field. 

Physiologist Christopher Tubbs used a 
six-week seminar at the Marine Biological 
Laboratory in Woods Hole, Massachusetts, to 
deepen his knowledge of reproductive biol- 
ogy. Tubbs, who now examines the effects of 
endocrine disruption on endangered wildlife, 
took the seminar as a doctoral student in the 
hope of learning new ways to study fish-sperm 
motility. 

At the same time that he learned more 
about fish, he acquired a new understanding 
of hormones, hormone signalling and gamete 
biology that helped him to secure his current 
post at the San Diego Zoo Institute for Con- 
servation Research in California. 

Savvy researchers can often find ways to 
bring down costs to achievable levels. Chemist 


© 2014 Macmillan Publishers Limited. All rights reserved 


want to learn the same skill. 

@ If money is tight or travel difficult, consider 
an online option. 

@ Sometimes, an in-person course is the 
only way to gain a particular skill. If that skill 
is needed at a crucial point in your training, 
you should make a good case for the time 
and expense. 

@ “When you apply for funds, you need to 
get really good at framing why that course is 
essential for you at this stage in your career,’ 
Gill says. If you can’t communicate that 
need effectively, maybe you do not need the 
course as much as you thought. S.W. 


Jennie Mayer of Bellevue College in Washing- 
ton wanted to learn how to maintain a new 
mass spectrometer and nuclear magnetic reso- 
nance spectrometer purchased by her insti- 
tute. Fortuitously, Mayer subscribed to e-mail 
updates from the American Chemical Soci- 
ety (ACS), of which she was a member, and 
received one about its six-week series of online 
chemistry courses known as Sci-Mind. These 
include tutorial videos, laboratory exercises 
and opportunities to interact with instructors 
and other students. 

At its original price, the course would have 
cost just half that of the on-site equivalent, 
and she managed to get both the members’ 
rate and a promotional discount that ACS was 
offering, with a final price tag to her institu- 
tion of about $900. The course refreshed her 
skills on how to use the instruments and 
taught her how to maintain and troubleshoot 
the spectrometers. She is now creating mod- 
ules that will help her to train students and her 
colleagues in their use. 

Asa postdoc, palaeoecologist Jacquelyn Gill 
opted to take a massive online open course 
(MOOC) in calculus because it was free and 
she could fit it into her schedule as she pleased. 
She had come to realize that she would need 
much deeper familiarity with calculus for 
her field’s increasingly quantitatively based 
research, but enrolling in a class alongside 
undergraduates felt awkward. So when the 
MOOC format became available, she leapt 
on-board and signed up for a 14-week course 
through the online platform Coursera. Each 
week, she spent just two hours working on 
problems and taking online quizzes. 

Now a faculty member at the University of 
Maine in Orono, Gill stresses that it is never 
too late to learn something new — and far too 
easy to put it off. “Just do it” she says, “and do 
itnow. # 


Sarah Webb is a freelance writer based in 
Chattanooga, Tennessee. 


PAUL KOVACH, UNIV. PITTSBURGH 


TURNING POINT 
Marta Zlatic 


Marta Zlatic, a neurobiologist at the Howard 
Hughes Medical Institute’s Janelia Farm 
research campus in Ashburn, Virginia, has 
published an atlas of the larval fruit-fly 

brain that pinpoints the neurons responsible 
for recorded behaviours. It is a major step 
towards realizing her dream of understanding 
how brains mediate behaviour. 


What were your early interests? 

Growing up in Croatia, I had fantastic Latin 
and Greek teachers. I loved grammar and 
the rules of language, learned eight lan- 
guages and enjoyed reading literature in its 
language of origin. But I was also interested 
in biology, stemming largely from wanting to 
understand the neural basis of language and 
behaviour. I studied linguistics and Russian 
at the University of Zagreb in Croatia before 
starting at the University of Cambridge, UK, 
where I was able to explore my interests in 
neuroscience, languages and theatre. 


What were your pivotal career moments? 
There were a couple. One was attending 
lectures on the assembly of neural circuits 
in fruit-fly embryos by Cambridge neuro- 
scientist Michael Bate, who inspired me to 
study the first steps of that process. I ended 
up doing both my PhD and postdoc at Cam- 
bridge. Another was receiving a prestigious 
postdoctoral fellowship from Cambridge’s 
Trinity College, which allowed me to freely 
pursue research in neural-circuit assembly at 
both Cambridge and abroad. I spent a year 
at Columbia University in New York and a 
year at Janelia Farm, where I discovered that 
colleagues were developing amazing genetic 
tools, including transgenic fruit flies in which 
you could manipulate neuron types, which 
is necessary to create a functional neuro- 
anatomy map. 


How did working at Janelia affect your career 
plans? 

My husband, Albert Cardona, and I met as 
visiting postdocs at Janelia. I was working 
out how to piece together a neural-behaviour 
map, and he was determining the viability of 
aconnectome, a map of neural activity in the 
larval fruit-fly brain. He got a job in Zurich, 
and I was hired as a group leader at Janelia. 
We came up with long-distance strategies 
such as working as visiting scientists at each 
other's institutes until Albert was hired as a 
group leader in neurobiology at Janelia in 
2011. Our aim is to put the neural behav- 
ioural map and the connectome together 


— an important step towards linking 
behaviour to the underlying brain circuitry. 


Why start with fruit-fly larvae? 

Fruit-fly larvae have only about 10,000 neu- 
rons, compared with the adult’s 100,000, so it 
is a manageable system. Plus, we could experi- 
mentally control most neuron types of the lar- 
val neurosystem using existing genetic tools. 


How did you put the atlas together? 

Janelia was one of the few places in the world 
where this research could have been devel- 
oped. We had access not only to transgenic 
fruit flies, but also to images of neuronal 
expression patterns in those larval brains. 
In collaboration with Janelia colleagues, we 
developed a system that could generate mov- 
ies of larval behaviour in 40,000 animals in 
response to stimulation of specific neurons. 
We also worked with Carey Priebe at Johns 
Hopkins University in Baltimore, Maryland, 
who created a program that could distinguish 
between 29 types of behaviour and map each 
to specific neurons. Altogether, creating the 
atlas took three years of collecting data, 
making sense of them and verifying then. 


How will the atlas affect your career? 

This is the foundation for the work I want to 
continue with Albert. We need to combine 
three maps: of neuron—behaviour, neuron 
connectivity and neuron activity. Together, 
they will lay the groundwork for a brain-wide 
understanding of behaviour. If we can one 
day understand how complex sequences of 
actions are generated and learned in model 
organisms, maybe we could look for similar 
principles in humans. = 


INTERVIEW BY VIRGINIA GEWIN 
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WORK-LIFE BALANCE 


Learn from leisure 


All work and no play could make for a 
mediocre researcher, according to a study 
(K. J. Eschleman et al. J. Occup. Organ. 
Psychol. http://doi.org/sfg; 2014). Workers 
who also engage in creative activities such 
as painting or playing music perform more 
effectively in their job, says lead author 
Kevin Eschleman, a psychologist at San 
Francisco State University in California. 
Creative activities outside work also help 
employees to bounce back from stress and 
to learn new skills, he says. Eschleman and 
his co-authors surveyed 433 US workers in 
various occupations, including 92 active- 
duty US Air Force captains. Eschleman 
advises scientists to take up leisure 
activities to help their conscious mind to 
detach from workplace pressures and their 
subconscious to find different approaches 
to their research. 


UNIONS 
Adjunct action 


A concerted effort by an international 
union to link up adjunct US faculty 
members is succeeding in its city-by- 

city approach. The Service Employees 
International Union has enlisted about 
70% of adjunct faculty members in the 
Washington DC area. Malini Cadambi, 

the union's higher-education campaign 
director, says that 80% sign-up is possible. 
The union last year launched similar 
strategies in Washington state and 
Massachusetts, and is expanding to seven 
more US regions. A wide membership base 
provides adjunct faculty members with 
more bargaining power in their push to 
raise pay, make evaluations fairer and boost 
academic freedom, says Cadambi. 


REMUNERATION 
Narrowing pay gaps 


Gaps between vice-chancellors’ and 
professors’ pay rises reported in the Times 
Higher Education 2012-13 salary survey 
were highlighted as UK faculty members 
negotiated contracts for the upcoming 
academic year. The survey found that 
professors salaries rose by an average of 
just 1%, to £75,472 (US$126,782), whereas 
vice-chancellors’ pay (including some 
benefits) jumped by 5.5% to £232,120. This 
year, professors have been offered a 2% 
rise, says Jenny Sherrard, a spokeswoman 
for the University and College Union in 
London. “It’s not acceptable for those at the 
top to be receiving higher percentage pay 
rises than are on offer to the rest; she says. 
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Ua SCIENCE FICTION 


KILLSTEALER 


BY ALTER S. REISS 


CC ill you put that game 
down!” snapped Steve, 
and Brendan turned 


to his head, looking surprised 
and hurt. 

Steve was immediately sorry. 
He couldn't yell at what was hap- 
pening, but snapping at the kid 
it was happening to wasn't fair. 
Brendan was just 12, old enough 
to understand what was 
going on, but not old enough 
to... not old enough for any- 
thing. He looked like he hadn't 
slept; bruises under his eyes, 
skin like paper. He looked like 
he was dying. 

“Sorry, said Steve. “Just... stay 
with me for a little bit, okay?” 

“Okay,” said Brendan. “But let me 
wrap things up. My friends will —” and 
he looked back at his game and started 
making excuses into his headset. 

Friends, Steve thought. Like the kid had 
any real friends — just people on the net. I 
should’ve pushed harder, with school, with 
summer camp, things like that. Too late now. 
He breathed, tried to focus. His job was to be 
Brendan's father, for however much longer 
Brendan had. 

“There. What is it, Dad? What’re you 
printing?” 

There was the click and whirr as the 
printer finished its job. Best system on the 
market — laser sintering for four metals, 
14 elements total for non-metallic compo- 
sitions. Should have spent his time with his 
kid instead of with his toys. “Bots,” he said. 
He took out the capsule it had printed, and 
passed it over to Brendan. “Here, take this.” 

Brendan frowned, but took it and swal- 
lowed it down with a cup of water. Hed been 
given a lot of pills to take since they'd found 
the problem, but he was never a complainer. 

By the time Steve had the software set up, 
the bots were already in Brendan’s blood- 
stream, and the boy was leaning on his shoul- 
der, interested. “Thought it might help to see 
what's happening,” said Steve, which wasn't 
exactly true. The book hed been reading rec- 
ommended showing kids what was going on, 
and he thought that sounded reasonable. 

The screen revealed a confusion of shapes, 
all moving at different speeds. “That’s your 
blood, you see,’ he said. 

“And those are the cancer cells?” 
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A shot inthe dark. 


Steve winced, wishing Brendan hadn't 
used that word. “No,” he said. “The ones 
outlined in red are the red blood cells. 
There, that one in yellow is a white blood 
cell. Some of them are the ... some of them 
aren't working right.” 

“Like that one?” 

“Yeah,” Steve said. “The hairy-looking 
ones are the bad cells. And check this out.” 
He pressed some buttons, and the bot went 
forward, into the cell, and it fell apart. 

“Cool.” Brendan slid in next to him, took 
over the keyboard. 

“If we could do chemo, the bone marrow 
would stop making —” 

“But we can't, said Brendan. “Because 
of my liver. And we tried other things that 
didn’t work. So I'll pop them” 

Steve shook his head, trying not to cry. It 
had seemed reasonable in the book, but it 
wasnt going to be easy. “There are too many 
of them to get. Tens of millions. But you can 
get a few, give them a fight.” 

“Thanks, Dad; said Brendan, settling in 
more comfortably. “Maybe I'll get some of 


my friends to help — 
> NATURE.COM are these things net- 
Follow Futures: worked?” 
WS @NatureFutures “Tm not sure,” said 


Ei go.nature.com/mtoodm © Steve. He wanted to 
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stay. There wasn't anything he could do, but 
he wanted to stay. Only, while their insur- 
ance was rubbish, if he lost his job, itd be 
gone too. “Look, I have to get to 
work; if you find a better pattern, 
you can print it out.” 
“Thanks,” said Brendan, 
but already he wasn't listen- 
ing. Trade one game for 
another, but the book 

Steve had been reading 

recommended it asa first 

step, and it seemed to get 

Brendan interested. 

When Steve got back 
home, Brendan was still 
popping cancerous lym- 
phocytes. Maybe it was a 
mistake, letting him think 
that he'd be able to beat 
leukaemia one cell at a time. 

The idea in the book was that 

Brendan would get frustrated 
and stop, and get angry and come 
back to it, and eventually get used to 

the idea of how big the problem was. 

“How's it going?” Steve asked. 

“Oh, that’s not me,’ said Brendan, pull- 
ing back, and giving his father a half smile. 
“Once they started getting bonus points for 
the spawn sites —” 

“The bone marrow?” 

“Yeah, the screwed up bone marrow. Even 
with the cascading bonus for the last few, 
you can't really get that many points work- 
ing on me. There'’re still some guys going for 
those bonuses, but ’'m working on some guy 
in Fiji who just started printing.” 

Steve tried to process. “There are tens of 
millions of cancer cells in your blood, he said. 

“Were,” said Brendan. “My friends got the 
interface and the targeting better, and their 
friends wanted to help too. There’s a leader- 
board and there are a lot of people playing” 

“You're better?” asked Steve, turning 
Brendan around for a closer look. He still 
looked like he hadn't slept, but his eyes were 
clear, and maybe his skin was even a little 
better. 

“Yeah, but this guy ... Dad, you're crush- 
ing me. Dad! Oh, man, someone’s stealing 
my kills, Dad!” = 


Alter S. Reiss is a field archaeologist and 
scientific editor. His fiction has appeared 
in Fantasy & Science Fiction, Strange 
Horizons, Daily Science Fiction and 
elsewhere. 
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